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CORRELATIONS OF MENTAL ABILITIES* 

I. The Problem and Its Importance 

What constitutes general intelligence? How can we measure 
its amount? These are questions of immense practical impor- 
tance as well as of theoretical interest. Men in every line of 
activity are called upon every day to pass judgment upon the 
mental capacity of individuals and of groups. In many cases 
a choice must be made between a number of applicants of vary- 
ing degrees of capacity and fitness. Other things being fairly 
equal, the matter of prime importance for the judge to discern 
is the general mental ability of each of the persons in ques- 
tion. This judgment must be made in one of three ways: (i) 
by the examinations the candidate has passed and the certificates 
he has gained as a result of definite study; (2) by the opinions 
or recommendations concerning the candidate, given by those 
who know him and his work; (3) by the general impression 
gained from the way the candidate conducts himself during the 
course of the interview. We shall not dwell upon the inade- 
quacy of these tests as a means of determining the general in- 
telligence of an individual. The first, at best, gives a measure 
of the candidate's attainments along the lines tested, and only 
indirectly and secondarily gives an indication of his ability. The 
second and third are subject to all the inaccuracies of unscien- 
tific and ill-grounded personal opinion. Much as we need to 
get the right people into the right places, comparatively little 
has been done to replace these empirical methods by scientific 
ones. 



*The problem of this research was suggested and outlined by Professor 
E. L. Thorndike, and indebtedness is cheerfully acknowledged to him for 
a teacher's guidance and help in every difificulty. The work as carried out 
has been somewhat less comprehensive than that originally suggested. 

Grateful acknowledgment is also due the seventeen professors and stu- 
dents of Teachers College who acted as members of the " Good " group 
of subjects, to Miss Rusk for assistance in scoring a number of the 
records, and to Dr. Whitley. 

For the conclusions stated, the writer alone is responsible. 

I 



2 Correlations of Mental Abilities 

The same holds true with regard to school determinations of 
ability. Certificates, degrees, and the like, of all grades of im- 
portance are given on the basis of demonstrably inadequate 
measures of mental capacity or amount of training, and later 
offered as valid measures of either or both. Students of educa- 
tion have felt the inadequacy of the old time methods to 
diag[nose and measure with any degree of accuracy the real abili- 
ties of the pupil, and students of psychology have, beginning 
with Galton, been devising tests of mental capacities both special 
and general. This work has been summarized in Whipple's 
"Manual of Mental and Physical Tests" ('lo), from which 
may be gained a just notion of the range of experimentation, the 
mistakes and improvements, and the present hopeful status of 
intellectual diagnosis by objective tests. The early workers 
along the line of devising mental tests for the measurement and 
diagnosis of general intelligence now see their labors justified 
by practical results. The period of discouragement and tem- 
porary defeat in the use of this method has been passed, and 
the time has come when workers in this field can go ahead wjth 
confidence that in due time results of much practical impqrtance 
will be secured through painstaking and intelligent iny^tigation. 
Once a series of mental tests can be perfected >hat will en- 
able us to determine the nature and amount of ^ person's mental 
capacity with a fair degree of accuracy, a cprner stone will have 
been laid toward the foundation of a science of education. As 
yet we are not in a position to do justice either to the exception- 
ally bright or the exceptionally dull pupils, to say nothing of 
pupils of smaller degrees of variation from the average. We 
have as yet perfected no scientific method of picking out excep- 
tional children, and until we have adequate means of doing this, 
we cannot expect to have their respective needs properly pro- 
vided for. 

But can we hope to find the means of classifying pupils in 
this way, according to the degree of their intelligence ? The an- 
swer to this question is to be found in the results that have al- 
ready been achieved by the use of even such imperfect tests as 
the Binet-Simon Tests of Intelligence. Already they are being 
successfully used and widely adopted in schools for the feeble- 
minded, to determine the mentality of the subject and the conse- 
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quent treatment. They have been adopted in New Jersey as 
a means of diagnosis for retarded children. They have been 
used in courtroom procedure in New York City for the pur- 
pose of ascertaining the mental status of a youthful criminal, 
with a view to determining to what extent he should be held 
accountable for his conduct, and the sort of education he should 
subsequently receive. And this is only a crude beginning. In 
America alone the work of perfecting and extending such tests 
is being energetically pushed forward by Goddard in New 
Jersey, Wallin in the University of Pittsburgh, Huey in Johns 
Hopkins University, Terman in California, and others. Similar 
tests are being used elsewhere, such as the Sante De Sanctis 
tests in Italy. The more attention is given to studying the tests 
themselves, and developing new tests of greater convenience, re- 
liability and significance, the greater will be the practical educa- 
tional results. 

This monograph is a contribution to knowledge of individual 
differences, especially in that complex of qualities which we call 
' general intelligence,' and of the means of measuring them. Its 
special purpose is to determine the significance of certain tests 
by showing what relations they have to one another and to gen- 
eral intelligence in its common meaning. On the other hand, 
variations in general intelligence, as imputed to this, that and 
the other individual by the world's judgment, will themselves be 
better understood by the discovery of their relations to achieve- 
ment in these tests. 

II. General Method of the Investigation 
I. Abilities tested 

The general method of the investigation was to select tests 
of a variety of mental. abilities, which may be grouped roughly 
under six headings, namely, sense-discrimination, motor-control, 
efficiency in perception, efficiency in association, memory, and 
what, for lack of a better term, I shall call abstraction or selec- 
tive thinking. Of course, all of these abilities may involve com- 
mon elements to a greater or less extent. On the other hand, 
each test is a test of a specific thing. For instance, the tests of 
perception are tests of a specific and limited kind of perception, 
under particular and limited conditions — to mark the A's or 
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the B's on a blank like that on page 112, and to mark the hexa- 
gons and halfcircles on a blank like that on page 113. If the 
test turns out to be a reliable one, that is, if two trials of the 
same test correlate highly with each other in a group of persons, 
it is evident that something definite in the way of ability has 
been tested. Whether or not the ability tested is accurately 
named, is a matter of secondary consideration at the outset. For 
instance, if we give two trials of the "A test " (marking as rap- 
idly as possible all the A's on a printed page), to each member 
of a group of a hundred persons, and find that the order of ex- 
cellence in the first trial is closely the same as it is in the second 
trial, — or in other words, that the two trials correlate very 
highly with each other — it is evident that the records secured 
are measures of some definite ability, even though we may not 
be sure that that ability is accurately named efficiency of per- 
ception.^ 

Again if two trials with the Easy Opposites test should prove 
similarly to test some one and the same thing, and if there 
should turn out to be a positive correlation of 50 per cent or more 
bttween the A test and the Easy Opposites test, it is evident that 
we have measured a mental relationship between different abili- 
ties, even though we may not yet be able to say with scientific 
accuracy that one is a test of perception and the other a test 
of association. However, the accurate naming of general abili- 
ties should be aided by the gaining of measures of relationship 
between different mental tests. For instance, if the A test and 
the Geometrical Forms test (marking all the geometrical forms 
of a certain kind on a page of printed geometrical forms), cor- 
relate almost as highly with each other as two records of the 
A test correlate with one another, it is evident that the two 
tests — A test and Geometrical Forms test — measure very much 

* The experimental work was done in 1907, but for various reasons pub- 
lication has been delayed. Since the writer undertook the investigation 
important studies of a similar sort have been made — notably those of 
Thorndike ('09), Burt ('og), Bonser ('10), Whipple ('10), Whitley ('11), 
and Woodworth and Wells ('12). If these studies had been available then, 
the writer could have altered and much improved the tests which he used. 
But at that time the work on the significance of tests was practically only 
that summarized by Thorndike ('03), and Spearman ('04), all done before 
the discovery of ' attenuation ' by the variability of the result of a few 
trials from that true for an individual on the whole; and two studies by 
Spearman ('04) and by Krueger and Spearman ('06). 
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the same ability, and that they are properly classed under the 
same general heading, in this case perception. 

2. Tests 

The tests used, fifteen in all, are described in detail in the ap- 
pendix on pages 112 to 122. They included two of perception — 
marking A's and marking geometrical forms ; three of memory 
— ^memory of unrelated words (auditory), memory of passages 
(auditory), and recognition of twenty-five forms studied, 
amongst fifty shown later; four of association — addition, easy 
opposites, associating words with hieroglyphic forms in pairs, 
and adding letters to make ba, ca, . . . be, ce, etc., into 
words (referred to as the ba or Completing Words test) ; three 
of selective thinking — ^the hard opposites, the Ebbinghaus or muti- 
lated text, and the absurdities test; two of sense discrimination 
— drawing lines each equal to a given length, and estimating the 
comparative lengths of pairs of lines'; and one of motor control 
— the scroll test. Most of these tests had been used at Columbia 
University. 

3. Subjects 

As to the persons selected as subjects to be tested, the general 
plan was to take two groups of adults, representing as far as 
possible the two extremes of ' general intelligence ' as judged 
by the world. The group representing the high grade of intel- 
lectual efficiency — hereafter called the Good group — was made 
up of seventeen professors and advanced students of Columbia 
University. That the persons making up this Good group repre- 
sent a high degree of mental ability and efficiency is evidenced 
by the number who have since attained high positions in the 
teaching profession. At least five of the thirteen who were then 
graduate students have since secured university positions, and 
others are holding positions almost, if not equally, responsible. 

It is safe to say that if an omniscient judge should rank the 
half million teachers of the country in order for ' general intelli- 
gence,' at least three of the seventeen would be put in the high- 
est hundredth of them, and at least ten, very likely all, of the 
seventeen, in the highest tenth of them. It is also the case that 
the seventeen would rank far above the average man in the 
management of affairs. 
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Those m the group representing the low grade of efficiency 
were twenty in all, selected from men in New York City who 
had never held any position demanding a high grade of intelli- 
gence. All were of mature age and fairly comparable in that 
respect with the members of the Good group. In order that 
none should be at a decided disadvantage in the language tests, 
none were selected who did not speak English as their mother 
tongue. Two of them were persons earning comfortable livings 
for their families, but men recognized by their associates as 
being dull. Eleven others were staying at the Salvation Army 
Industrial Home at the nominal salary of $i per week in addi- 
tion to board and room, until work could be secured. One of 
these held a somewhat responsible position at the time, acting 
as assistant superintendent of the Home. He stood high in the 
most significant tests. The subject who did poorest of all was a 
man twenty-four years of age who had been at the Home for 
four years, and was quite content to remain there indefinitely 
on the permanent wage of $i a week besides board and lodging. 
The remaining seven were found in a mission on the Bowery 
where they were being helped somewhat until they could find 
employment. Altogether there were thirty-seven subjects, seven- 
teen in the Good group, and twenty in the Poor group. 

From the way in which those in the Poor group were selected, 
it is clear that they represent persons of a low grade of intel- 
lectual efficiency. They are, with the two exceptions noted 
above, persons who for some reason were out of employment. 
In some cases, no doubt, the lack of employment may have been 
partly due to habits of intemperance, but this defect would in 
most instances be insufficient to cause discharge if the man were 
otherwise above the average in efficiency. Nor would such cases 
be common in these religious establishm.ents. In my opinion 
these twenty men were as temperate in respect to alcohol as the 
average New York male. 

Most of the Poor subjects had worked at several different 
occupations. The facts, as far as obtained, were as follows: 
Brakeman, clerk, timekeeper; soldier, cook; teamster; wood 
worker, help on boats, hostler; machinist; worker in silk fac- 
tory, insurance broker, janitor; clerk, office help; shoe-laster; 
painter; clerk, teamster; help in customs house, night watch- 
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man, checker on ship; hand in shoe factory; blacksmith, crock- 
ery business; salesman, collector of accounts; cook, mechanic; 
seaman, general help, cook, farm laborer ; foreman of milk busi- 
ness; wagon painter; farmer's help, employee in knitting fac- 
tory ; printer's help, driver of express wagon. 

The three who seemed the slowest and dullest of all, were 
the two who had regular employment but had been mentioned 
to the writer as being considered dull by their associates — ^the 
foreman of the milk business and the wagon painter — and the 
young man of twenty-four who was quite content to stay indefi- 
nitely at the Salvation Army Industrial Home. 

4. Method of giving the tests 

The tests were all given individually in the same order and 
as nearly as possible in the same way. To avoid fatigue, they 
were given at two sittings, except in about three cases with per- 
sons of the Good group, where they were completed in compara- 
tively short time and with no visible signs of fatigue. The time 
taken to complete the tests varied from about three to five or 
six hours. Those doing them most quickly secured on the whole 
by far the best records for accuracy as well. 

The zeal with which the subjects undertook the tests is beyond 
question a matter to be carefully considered. This seemed to 
the writer as satisfactory as could reasonably be expected. In 
the case of the Good group all were eager to do their best be- 
cause of interest in the tests and their results for science. In 
the case of the Poor group, with the exception of two or three, 
the same was true, although perhaps to a somewhat smaller 
extent. At first a little difficulty was experienced in getting per- 
sons of this class to take the tests. Those at The Salvation 
Army Industrial Home were in some cases at first a little diffi- 
dent about it. They seemed to fear that their mentality might 
be subjected to a rather minute diagnosis. However, as soon 
as it became known at the Home that all of those selected were 
to be paid a certain amount " to complete the course," and that 
there was nothing very extraordinary about the tests, they were 
willing to do their best to give satisfaction, and also were eager 
to make good records. Only two of the whole thirty-seven 
(Good and Poor) seemed to find the tests irksome before the 
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finish, viz., number thirty-five who complained that his eyesight 
handicapped him, and number thirty-seven who showed httle 
evidence of interest in any of the tests. This was probably due 
to the fact that he had little interest in anything at all intel- 
lectual. There is no doubt that number thirty-five was handi- 
capped in most of the tests, and especially in those of perception, 
by poor eyesight; and possibly physical fatigue influenced his 
results, as he had worked all day at his trade of wagon-painting. 
A number of those on the Bowery seemed to do their best because 
of an idea that the discovery of unexpected talent might lead to 
the offer of a good position. 

III. The Administration of the Tests in Detail 

1. Order 

A copy of each test used is given in the appendix on pages 
112 to 122. The name used hereafter in speaking of the test 
is also given there, as well as brief general directions to the sub- 
ject about to take the test. The reader is therefore advised 
to turn to the appendix at this point for this information. The 
order in which the tests were given is the order in which they 
follow each other in the appendix. Some of the easier tests, 
such as those of perception, were given first so that the Poor 
subjects might not become discouraged at the outset. 

2. Instructions 

As far as possible, the instructions for any particular test 
were the same for all subjects. But on account of the wide 
differences between some members of the Poor group and the 
members of the Good group, it was necessary to give more de- 
tailed instructions to the Poor subjects, to insure that they 
understood beforehand exactly what they were required to do. 

In giving the A test, the subject was allowed to glance at 
the page of printed capitals, and was then told to take up the 
paper as soon as the signal was given, and mark all the A's 
as quickly as possible with a pencil, in any manner most con- 
venient to himself. The time was taken with a stop watch. The 
second trial consisted in giving him a new copy of the test with 
instructions to mark all the B's as rapidly as possible. In the 
second test — Geometrical Forms — ^the instructions were similar, 
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except that special pains were taken to make sure that the sub- 
ject understood clearly what sort of geometrical form was to 
be marked. To insure this, he was allowed to glance for a mo- 
ment at the printed slip, and then told to mark only the hexagon 
with the point up. This was further illustrated by drawing the 
hexagon to be marked on a separate piece of paper. In the 
second trial it was similarly made clear that the figure to be 
marked was the half-circle with the ftat side up, and not the 
half-circle in any other position. In taking the Scroll test of 
motor-control, the same fountain pen was used by- all of the 
subjects. The directions given for the first trial were: " Trace 
the white part of the spiral as rapidly as possible without touch- 
ing the black part." As the time taken for the first trial varied 
to a considerable extent, on the second trial the subject was re- 
quested to go either faster or slower than before so as to make 
the time occupied in the second trial about three minutes. 

In the Easy Opposites test, the subject was required to give the 
opposite of the printed word orally, instead of by writing it, 
so that the test would be a measure of quickness of association 
rather than a measure of quickness of writing. The instructions 
were: "As quickly as possible give orally a word that means 
the exact opposite of the word in the list. Thus if you see the 
word small, say large, and so on down the list." 

In the Recognizing Forms test, the subject was allowed to 
glance for a moment at the two sheets. It was then explained 
that he would be given one minute in which to study the small 
sheet, and that at the end of that time, he would be asked to 
mark on the large sheet only those forms that were exactly the 
same as the ones he had seen on the small sheet. In both cases 
subjects were given all the time they desired to mark the forms 
on the large sheet. 

In giving the Memory of Words test, it was explained that 
a list of simple words would be read aloud once, and that as 
soon as the last word was heard, the subject was to write down 
in any order as many of the words as he could remember. 

In the Pairs test — connecting a word with a hieroglyphic form 
— the subject was allowed to glance at a sheet containing the 
forms only, and another sheet with the forms and words in 
pairs. He was then told that he would be given one minute in 
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which to study the second sheet, so that when the forms only 
were given he could write down for each, the corresponding 
word. In the last two of the four trials given, one and a half 
minutes were allowed in which to study. 

In the Memory of Passages test, care was taken always to 
read at a uniform "Tate and as distinctly as possible. Two or 
three of the Poor group complained that they could not spell 
well enough to write down the substance of the passage heard. 
In such cases the experimenter took down the substance as the 
subject gave it orally. 

In giving test IX — Drawing Lengths — the subject was allowed 
to draw each line on the page of a ruled exercise book, but 
each of the lines he drew was covered up before drawing the 
next line, to insure that each drawing was an estimate of the 
original length and not a copy of the first line drawn. 

In giving test X — Estimating Lengths — two horizontal lines 
of nearly equal length were drawn side by side on a piece of 
cardboard. They were heavy lines drawn with a drawing pen 
and India ink, so that they could be clearly seen and easily com- 
pared. There were four sets of cards, with eight exactly alike 
in each set. In the first set the difference in length in the two 
lines was 8 mm., one lOO mm. and the other io8 mm. In the 
second set the lines were loo and io6 mm. respectively; in the 
third set loo and 104 mm., and in the fourth set 100 and 102 
mm. In giving the test each cardboard was shown in turn to 
the subject, and he was asked simply, " Which is the longer of 
the two lines?" The whole test was then repeated, so that in 
all there were 16 estimates of each of the four lengths of this 
test. The subject was allowed all the time he wished to decide 
in every case, but little seemed to be gained by much hesitation 
in deciding. 

In test XI, Addition, the subject was allowed to glance at 
the addition examples, and was instructed to turn over the paper 
and start adding as soon as the signal was given, to add as 
quickly and accurately as possible, putting down the results as 
he proceeded. 

In test XII, Hard Opposites, the instructions were the same 
as in the Easy Opposites test, except that the subject was told 
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to write down the opposite of the printed word, instead of giving 
it orally. Unless the word was utterly unknown to him, he was 
to take pains to write down the most accurate opposite he could. 

In test XIII, Completing Words, or " ba test," besides the 
general directions indicated in the appendix, the subject was 
shown by an example what was to be done. 

To insure clear understanding of what was to be done in 
the Mutilated Text or Ebbinghaus test, a special sample was 
shown, and the blank spaces were filled in for him with ap- 
propriate words. 

In giving the last test. Absurdities, the instructions were, 
" Mark each sentence that contains an absurdity or impossibility. 
For instance, if a sentence stated or implied that lead was float- 
ing on water, mark such a sentence as absurd or impossible. Do 
not mark the sentences that contain no absurdity or impossi- 
bility." 

3. Individual differences in ability to interpret instructions 

Before giving the tests to any of the thirty-seven subjects 
whose records were secured, the experimenter took the precau- 
tion to practice himself in giving the tests to other persons, both 
for the purpose of acquiring the necessary skill in giving the 
tests in a uniform manner, and in order to determine the specific 
instructions necessary to be given to the subjects. Every 
reasonable precaution was taken to see that the instructions were 
understood before the test was begun, but in spite of this fact 
several of the poor subjects failed to follow instructions ex- 
actly. In some instances this seemed due to the fact that, though 
the instructions were understood when given, a part of them 
were forgotten as the work proceeded. In marking geometrical 
forms — test II — several of the Poor group started in by mark- 
ing only the hexagons with the point up, but before they got 
half way down. the page were marking all hexagons — with flat 
side up as well as with the point up. As the test was intended 
to be one of quickness in picking out forms and not one in 
ability to understand and remember instructions, they were not 
severely penalized for this.^ In three or four other cases it 
seemed fairly evident that the instructions were not perfectly 
' See later account of methods of scoring. 
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grasped before beginning the first trial, even though both sub- 
ject and experimenter thought that they were. For instance 
three of the dullest of the Poor group in doing test XIV, Ebbing- 
haus mutilated text, put several words in a single blank space 
where the sense seemed to them to require it, although they 
were distinctly told beforehand to put only one. Thus incident- 
ally the tests gave the experimenter an opportunity to form an 
estimate of each subject's ability to comprehend instructions. 
No attempt has been made to correlate this accurately with 
abilities as indicated by the tests themselves, but from memory 
of individual cases the writer is very confident that in general 
those who were dullest in comprehending and remembering in- 
structions were also poorest in the tests which later results 
proved to be most closely correlated with intelligence. In two 
cases a zero record — in test VII, Learning Pairs, — ^in the first 
of the four trials resulted from the fact that the subject took it 
for granted that the figures would be arranged in the same 
order on both the study slip and the test slip. He thus at- 
tempted to learn only the words in their correct order, instead 
of connecting the form with the corresponding word. How- 
ever, irregularities of this kind were of such slight consequence 
as not to interfere to any appreciable extent with the general 
results. 

IV. Scoring of Results 
I. General principles 

Any method of scoring the results of tests such as these must 
be more or less arbitrary. The method of scoring finally adopted 
was that which seemed fairest on the ground of common sense, 
and that which seemed to vary as little as possible from the re- 
sults secured by other reasonable methods of scoring. The 
original results which would take some scores of pages to 
print are all on file at Teachers College, so that any one who 
cares to do so can compare the final scores adopted here with 
those obtained by any other method of scoring that seems to 
him advisable, or test any of the writer's conclusions by com- 
puting the results by the different scoring. 
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2. Method of scoring for each test, and reliability of the score 

given 

The final score in the A test was got by taking the time in 
seconds required to complete the test, plus five seconds for each 
letter omitted. Thus the lowest score represents the greatest 
efficiency. As the average time for marking each of the fifty 
A's was not much under three seconds, it seemed fair to penalize 
each omission by something more than three seconds. On the 
other hand, if much more than five seconds were added for each 
omission, it would emphasize care and accuracy rather than 
quickness of perception. Moreover, it made little difference 
whether three, four, or five seconds were added for each omis- 
sion, judging by the average displacement of rank of the dif- 
ferent subjects. The average displacement in rank for each sub- 
ject owing to the difference in scores when four seconds are 
added for each omission, and when five are added, is only a 
trifle over one. The average displacement in case of adding 
three seconds for each omission and in adding five seconds for 
each omission is only two. Although this test is satisfactory in 
reliability, the average displacement in rank between the first 
trial (marking A's) and the second trial (marking B's) is 
slightly more than five. Thus the method of scoring adopted 
seems quite satisfactory. There were no errors of any kind in 
this test except omissions. 

For similar reasons the method of scoring the Geometrical 
Forms test was to take the number of seconds and add three 
for each omission in the first test, and six for each omission in 
the second. There were more than twice as many hexagons 
to be marked as half-circles. Hence it took about twice as long 
on the average to find and mark each half -circle as to find and 
mark each hexagon. Nothing was taken off for forms wrongly 
marked, since they were already penalized somewhat in that it 
took some time to mark the wrong form, and since if much were 
deducted for this, it would amount to making the test one of 
ability to understand instructions rather than one in quickness in 
picking out forms. 

The score in the Scroll test was got by taking the time in 
seconds and adding ten seconds for each touch of the black 
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lines. Ten was selected as the number to be added for each 
touch, mainly because it equalized the scores of each subject 
in the two trials better than any other number. For instance, 
if five seconds instead of ten were added for each touch, the 
score in the second trial would be better than the score in the 
first trial in twelve cases, and poorer in twenty-five. If ten 
seconds were added for each touch, the score in the second 
trial would be better in eighteen cases, and poorer in nineteen 
than the first. If nine seconds were added, the second trial 
would be poorer in seventeen cases and better in twenty than the 
first. 

How much, if anything, should be allowed for improvement, 
it is very difficult to say, as it was not possible to keep either 
the time or the number of touches uniform in the two trials. 
Neither is it very helpful to estimate the average number of 
seconds that is equivalent to avoidance of one touch, as the 
variation is so very wide. In fact it seems difficult to find any 
system of scoring this test as given, that is altogether justifiable. 
On the whole it seems best to assume that the two trials are 
tests of the same thing, and that therefore that method of scor- 
ing is fairest which on the whole makes a given subject's first 
score and his second score as nearly equal as possible. Adding 
ten seconds to the time for each touch does this. 

For scoring the Easy Opposites test, the experimenter had 
kept track of the number of words that were correctly given, 
the number incorrectly given or omitted, and the number that 
could be considered half right. The score taken was the time 
in seconds, plus two seconds for a word half wrong, and four 
seconds for a word wrong or omitted. Notes were made at the 
time the test was given, of how much time a subject took to 
think up a satisfactory opposite for the one or two words that 
caused most hesitation or difficulty. This formed the basis for 
estimating how much on the average it was most just to add 
to the subject's time in seconds as a penalty for errors or omis- 
sions. 

In scoring test V, Recognizing Forms, one mark was allowed 
for each form correctly marked, and one was taken off for 
each form incorrectly marked. On this basis, if the subject 
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were merely guessing, his score would on the average approxi- 
mate zero. 

In test VI, Memory of Words, the score given was simply 
the number of words correct. Nothing was taken off in case a 
wrong word was written down. This, later study has shown to 
be unwise, but there were only a few such wrong words inserted 
and, as the labor of recalculating correlations and group dif- 
ferences would be very great, the scores have been left as origin- 
ally niade. 

In test VII, Learning Pairs, the score given was simply the 
number of words right, except that there were a few instances 
where credit was given for a word half right, where the word 
was connected with the proper form but was not itself exactly 
correct. 

In scoring test VIII, Memory of Passages, the experimenter 
evaluated all of the records on a scale of from one to twenty- 
five. A month later he rated them independently a second time. 
An assistant in psychology also rated them independently. It 
was found that the three evaluations differed very little from 
each other. The average displacement in the rank of each sub- 
ject, according to whether he was ranked on the scoring of the 
assistant or on that of the experimenter's first scoring, was only 
about 2. The average change in rank between the experi- 
menter's first and second scorings was 2.3. As compared with 
this the average displacement of each subject in rank, between 
his score in the first two tests and his score in the last two, was 3. 

The score adopted in test IX, Drawing Lines, was the sum of 
the deviations plus and minus from the standard length. The 
constant error was separated out for study, but seemed to be 
of little significance, and so the deviation from the standard set 
was used. Out of the seventeen Good subjects only eight had 
a clearly positive error, and only four a clearly negative one. 

In test X, Estimating Lengths, each individual's number of 
correct judgments of each difference was recorded, and the indi- 
viduals ranked in order of merit accordingly. 

In scoring test XI, Adding, the time in seconds was taken 
and ten seconds added for each error. It was considered that 
the penalty should be estimated by the time that would probably 
be required to correct the error. This, in general, would be at 
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least somewhat more than was required to do that part of the 
sum. On this basis at least five seconds should be added for 
each error and probably more, as the median time for doing a 
whole question, with a possibility of two errors, was about ten 
seconds. Whether six seconds or ten seconds is added for each 
error, makes very little difference; the average displacement in 
rank caused by this difference in scoring being only slightly over 
I. The difference in rank between a subject's score in his first 
trial and his second is on the average, 3.8. On the whole it 
seemed that ten seconds was a fair penalty for each error. 

In test XI, Hard Opposites, all of the words given as op- 
posites were collected and evaluated on a scale of from i to 4 
by three different persons, viz., two assistants in psychology and 
the experimenter. The three evaluations differed very little 
from each other, the average displacement in rank caused by 
using one rating rather than another being only slightly over i. 

The final score given to each subject was got by averaging the 
marks of the three different scorers, and then adding a certain 
number of seconds to the time score, as a penalty for each word 
wholly wrong or omitted, and a proportionate amount for a 
word partly wrong. On the basis of the time actually spent by 
a number of subjects to think up suitable opposites for the most 
difficult words rather than omit them, 36 seconds seemed to be 
about the right penalty for the total omission of a word. 

Other methods were tried, but none was found which on the 
whole seemed to equate the two factors of equality and speed so 
fairly, especially with reference to those who did best and 
poorest. For instance, an attempt was made to equate the time 
element and the quality element by combining the quality score 
with a reciprocal of the time score. This suggests itself on the 
general principle that if A does twice as much work as B in a 
given time, he deserves a score twice as high; if A and B do 
the same quality of work, and A takes twice as long as B, he 
is worth only half as much. The variations in the time, how- 
ever, were so great that such a method could not be fairly ap- 
plied without the use of complex mathematical calculations. 
Moreover it would assume that all of the words are equally diffi- 
cult, which of course is not the case. 

For somewhat similar reasons it was difficult to find a per- 
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fectly satisfactory method of scoring in test XIII, Completing 
Words, as in several cases there were a number of omissions 
whereas in other cases subjects had spent times varying greatly 
in amount in attempting to avoid omissions. On account of the 
large number of omissions, the second trial was thrown out al- 
together. For a similar reason the records of eight subjects 
were thrown out in trial one. The score given was the time in 
seconds plus 10 seconds for each error or omission. As far as 
could be judged, 10 seconds seemed to be the time that probably 
would have been required to complete the word. 

In reducing the results of test XIV, Ebbinghaus Mutilated 
Text, to a numerical score, they were first marked by the ex- 
perimenter as to excellence in filling out the blanks so as to 
make sense, regardless of the time taken. They were then 
marked in a similar way by an assistant in psychology, and 
then marked independently a second time by the experimenter. 
These three ratings, which did not differ materially from one 
another, were averaged so as to get the final rating for quality, 
i.e., ability to do the test without regard to the time taken. The 
average displacement in rank of each subject according to the 
different scorings was between i and 2. As it seemed just to 
count the time taken to do the test about equally with the 
quality of the work, the final score adopted was a combination 
of the time score and the quality score, weighting quality a little 
higher than time. In doing this the time factor taken was the 
reciprocal of one-fifth of the time in seconds. This method is 
not open to the same objections here as in the Hard Opposite 
test, and succeeds fairly well in equating speed and quality. 
However, the method may seem unnecessarily complicated, and 
could not very well be used except under special and limited 
conditions. It would have been less trouble, and about equally 
fair, to have held to the general method used in tests previously 
mentioned, namely, to take as the final score the time in seconds 
plus a certain penalty in seconds for each mark lost in the score 
for quality. 

In giving the Absurdities test it was assumed that the record 
could be scored simply by allowing i mark for each absurdity 
correctly marked, and taking off a mark for each one wrongly 
marked. This would give a score for quality, with which the 
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time element could be combined to give a final score. It turned 
out, however, that the test as given was open to decided objec- 
tions, on account of containing a number of imperfections. Im- 
provements in the test will be discussed in another connection. 
As far as giving definite scorings was concerned, the test was 
finally thrown out. If, in spite of these imperfections, the re- 
sults are scored as originally intended, even regardless of the 
time element, it divides the two groups fairly well. 

The scores finally awarded to each individual in each test are 
given in Tables la to In inclusive. Each subject's total score in 
each test, and the scores in ist and 2nd trials of the test, are 
given in Table II. Table III gives the rank of each individual 
in each test. 
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TABLE II (continued) 
Total Scores, and Scores in 1st and 2nd Trials 
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TABLE II (continued) 
Total Scores, and Scores in 1st and 2nd Trials 
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TABLE III 
Rank of StJBJEcrs in the Different Tests 
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V. The Reliability of the Tests 

I. The two different methods used in treating correlation data. 

In all the discussions of correlations two different methods 
are used. First : Each measure is used in the form of a devia- 
tion, plus or minus, from the central tendency of the Good group 
in that trait, or of the Poor group if the individual in question 
belongs to that group. The coefficients of correlation so ob- 
tained measure the relation between (i) an individual's devia- 
tion in one test from the median of his group (the Good or the 
Poor) in that test and (2) his deviation in some other test from 
the median of his group in that other test. Second: Each 
measure is used in the form of a deviation, plus or minus, 
from the median of the entire thirty-seven individuals in that 
test. This median is approximately the central tendency for 
adult men the country over in the trait in question, since the 
Good and Poor represent, in tests correlated with intelligence, 
a group of seventeen cases far above that central tendency, and 
a group of twenty cases probably not quite so far below that 
central tendency. The coefficients of correlation by the second 
method then represent approximately the relation between (i) 
an individual's deviation in one test from the central tendency 
of all adults in that test, and (2) his similar deviation in another 
test, — ^but in the case not of a random group but of a group 
chosen from, say, the top and bottom ten per cent for general 
intelligence. 

This difference is illustrated in the case of the A test and 
Geometrical Forms test by the table following : 
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In calculating the correlations for the Good group alone, the 
Poor group alone, and both groups together, the deviations of 
each individual were measured from the following as central 
tendencies : 



Median of 
Good GRorp 



Median of 
Poor Group 



Median of 
Good and 

Poor 
Together 



Memory of Passages 
Memory of Words . . 

A test 

Adding 

Learning Pairs 

Recognizing Forms . 
Ebbinghaus test. . . . 
Drawling Lengths. . . 
Estimating Lengths. 

Scroll 

Completing Words. . 

Easy Opposites 

Hard Opposites . . . . 
Geometrical Forms . 



69 

29 

261 

200 

24 

21 

418 

35 

61 

72 

144 

127 

1014 

164 



23 

19 

366 

320 

4. 

9 

209 

32 

58 

128 

332 

295 

2623 

237 



43.5 

23 
299 
240 
9.25 

13 
289 

33.5 

59 

90 

167 

184 

1903 

200 
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The deviation of each individual from each of these medians 
in each test, is shown in tables V a, V b, and V c' 

In calculating coefficients of correlation corrected for attenua- 
tion, it is necessary to have two sets of deviations from the cen- 
tral tendency in each case, one set for the first trial and one for 
the second. These deviations are given in tables VI a, VI b, and 
VI c,^ and are measured from the following as central ten- 
dencies : 



Test I Test II 



Median of 
Good Group 



Median of 
Poor Group 



Test I Test II 



Median of 

Combined 

Groups 



Test I Test II 



Memory of Passages 
Memory of Words . . 

A test 

Adding 

Learning Pairs 

Recognizing Forms . 

Easy Opposites 

Hard Opposites . . . . 
Ebbinghaus test. . . . 

Scroll 

Geometrical Forms . 
Completing Words. . 
Drawing Lengths. . . 
Estimating Lengths. 



31. 

15 

122 

101 

8 

10 

65 

479 

188 

38 

88 

70 

11 

30 



33. 

14 

142 

94 

12. 

10 

60 

518 

227 

34 

83 

71 

14 

30 



12.5 
9 
184 
15 
2.7 
4 
136 
1149 
117 
63 
122 
176 
18.5 
28 



11 
9 
170 
170 
2. 
3. 
134 
1463 
92 
61 
108 
157 
15. 
28. 



20 

11.2 
144 
116 
4 
7.8 

88 
844 
158 

43 
110 

91 

15 

29 



23.75 
11.8 
148 
134 
5.8 
7.8 
86 
1041 
138 
47 
94 
77 
14 
30 



'A sample page only is printed here, the full table being on file in the 
Library of Teachers College, Columbia University. 
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TABLE Va 
Deviations from the Median — Good and Poor Subjects Combined 
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TABLE Vb 
Deviations teom the Median of the Good Subjects Only 
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TABLE VI 

Deviations from the Median — First and Second Trials — All 
Subjects Combined 
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TABLE VII 

Reliability of the Tests, as Shown by the Pearson Coefficients of 
Correlation Between First and Second Trials (Raw Coefficients) 

In the case of each test the heavy-face figure given first is for the Good 
and Poor together, divergences being measured from the median of the 37 
individuals. The second figure is for the Good group, divergences being 
measured from its median. The third figure is for the Poor group. 

92 
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The reader should remember that correlations for Good, for 
Poor, and for Good and Poor together will have these meanings 
throughout. 

2. Detailed discussion of the reliability of each test 

The reliability of any test will be measured by the closeness 
of the correlation between the two different trials with it. Table 
VII gives these " coefficients of reliability " for the different 
tests. Just what the two trials were in each case can be found 
in the appendix on pages iii to 121. 

The reliability of the A test was satisfactory, though not as 
high as could be desired. The correlation between trials i and 2 
was about 66 (72 taking the Good and Poor together, 62 taking 
the Good group alone, and 60 taking the Poor group alone). 
It would have been well worth the extra labor to have taken 
four trials instead of only two, for the sake of the added relia- 
bility of the results. The arrangement of the letters is such as 
to make the test a good one of its kind. 

The Geometrical Forms test seems to be somewhat higher in 
reliability than the A test, the correlations between marking 
hexagons and marking semi-circles being 69 for the Good, 91 
for the Poor, and 90 for both taken together. This may be 
owing to the fact that some members of the Poor group were 
handicapped by lack of familiarity with geometrical forms, mak- 
ing those least familiar with such forms stand lowest in both 
trials. In so far as this is the case, it simply means that what is 
tested is a combination of factors involved in the A test plus 
familiarity with the geometrical forms. 

The Scroll test as given is of doubtful reliability. Taking the 
Good group by themselves the correlation between the first and 
second tests is practically zero. For the Poor group the correla- 
tion is 71, and for both together it is 76. To get reliable re- 
sults from it, at least two or three practice trials should be given 
first so that all could learn to do it at a fairly uniform speed. 
Unless the rate of performing the test is fairly uniform, it is 
probably impossible to find a method of scoring that is entirely 
satisfactory. It is curious that the coefficient of reliability 
should be so much higher in the Poor group than in the Good 
group (71 in the Poor and — 4 in the Good). While conclu- 
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sions as to the cause of this are somewhat uncertain, the figures 
indicate that the test brought out individual differences in the 
Poor group, but not in the Good. What these differences are, 
is the doubtful point. They are probably in part due to dif- 
ferences in practice in handling pens and pencils. It is easy 
to see, however, that there would be a positive correlation in 
these days between intelligence and practice in handling pen and 
pencil, as few people would be in a position to make use of 
average intelligence without some occasion to use pen or pencil. 
It seems fairly probable that the high reliability in the Poor 
group is due also in part to the fact that some did poorly in 
both trials owing to poor muscular control, on account of lack 
of steadiness of hand. These two factors would operate to- 
gether, to make those who did best in the first trial do best in 
the second also. In the Good group, on the other hand, all had 
had plenty of practice in handling pen and pencil, and none 
showed decided lack of steadiness of hand, so that individual 
differences here were mainly in the mode of adaptation to this 
new test. Subjects in the Good group tended to vary their 
method in the two trials much more than the subjects of the 
Poor group, and two trials did not give time enough to allow 
the best method to become fixed in time to materially improve 
the subject's score. Had there been about ten trials taken, the 
advantages of rapid and skillful adaptation to this kind of thing 
would in all probability have had a chance to show themselves 
in the last few trials at least. If so, this would have resulted 
in increased reliability. The test could be shortened about one- 
third with advantage, as in some cases there was evidence of 
fatigue before a trial was completed. It is also somewhat ob- 
jectionable on account of tending to cause eye strain and dizzi- 
ness. 

The reliability of the next test. Easy Opposites, is satisfac- 
tory. The coefficients of correlation between trial i and trial 
2 are: Good 53, Poor 89, all together 93. It appears to be 
a good test of readiness of controlled associations. To persons 
of average ability the words in the lists are familiar from or- 
dinary conversation, and little effort is required to recall a satis- 
factory opposite for each. In so far as little hesitation is neces- 
sary to get any desired opposite, the test is a good measure of 
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rapidity of association. In some cases, however, a word that 
is easy for many may cause considerable loss of time to one or 
two. In the case of a few of the Poor subjects, their facility in 
handling even these words was so slight that the test was for 
them very much like the Hard Opposites test for the more able 
subjects. For those making the poorest records in this test, get- 
ting the proper opposites seemed to call for an exercise of their 
best thinking capacity. They thought of the words in sentences, 
and then of the opposites, and then selected the best by elimina- 
tion of the more unsatisfactory ones, whereas to the best sub- 
jects the correct opposites were suggested with a minimum of 
effort and hesitation. For the Poor subjects this is then a test 
of selective thinking. 

Test V, Recognizing Forms, was not, as given, satisfactory 
in regard to reliability. The coefficients were: Good — 41, Poor 
— 10, all together 40. The variability of the two trials is so great 
as to swamp small differences, such as occur within either group, 
and to show only very crudely the larger differences between an 
individual in the Good, and one in the Poor group. 

Test VI, Memory of Words, was fairly satisfactory as to re- 
liability, the correlations between the average of the first two 
trials, and the average of the last two being: 62 for the Good, 
60 for the Poor, and 72 for all together. The reliability could of 
course be improved considerably by giving eight trials instead 
of four. The labor involved in doing this would be well repaid 
by the gain in reliability. As to the number of words to be 
given, sixteen was undoubtedly so many as to be somewhat dis- 
tracting to several of the Poor group. The number of words 
best adapted to the Good group would probably be too many 
for securing most accurate results from the members of the 
Poor group. On the whole, twelve words would seem to be the 
optimum number. Eleven out of sixteen was the highest score 
secured by anyone. 

Test VII, Learning Pairs, was satisfactory in reliability, the 
coefficients of correlation between the two halves of the test 
being : Good 79, Poor 52, and all together 93. It appears to be 
a good test for a certain type of ability to form and use asso- 
ciations. 

Test VIII, Memory of Passages, showed a correlation be- 
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tween the two halves of the test of 78 for the Good, 83 for the 
Poor, and 90 for all together. The four passages of one hun- 
dred words each were selected with a view to eliminating as far 
as possible specialized individual differences and interests. The 
passages were selected from newspapers, on the presumption 
that people in general are more nearly on an equality of interest 
in the case of that style of reading matter than in the case of 
any other matter that might be chosen. The more the indi- 
vidual was interested in the particular subject matter chosen, 
and the more he knew about it, the better he would be able to 
assimilate and remember it. It is therefore apparent that any 
test of this kind cannot fail to be, more or less, a test of breadth 
of interest, as well as a test of memory. In fact, range of in- 
terest would be one common factor in power to memorize pas- 
sages and in general intelligence. 

Test IX, Drawing Lines, proved fairly satisfactory as given. 
The correlations between the first four trials and the last four 
were: 42 for the Good, 95 for the Poor, and 72 for all to- 
gether.^ It was evident to the experimenter that it was in part 
a test of pains taking and patience. In some cases subjects would 
be undecided as to whether to make any change in the length 
of the line they had drawn, and then on deciding to change it, 
would do so to the extent of 4 to 6 mm. It was evident that 
most subjects did not manage to discriminate length as closely 
in this test as in the test in comparing lengths shown in pairs. 
It would have been rather better to have taken four trials of 
each length instead of only three. 

Test X, Estimating Lengths, was hardly satisfactory in re- 
liability as given. The coefficients of correlation for the first 
half of the test and the second half were: 47 for the Good, 60 
for the Poor, and 48 for all combined. Half of the test as 
given was too easy to bring out individual differences. It would 
have been better to have made the differences i, 2, 3, and 4 mm. 
respectively, instead of 2, 4, 6, and 8 mm. respectively. While 
this method would enable one to get more accurate results as 



' For purposes of comparison, the experimenter also took the trouble to 
calculate the coefficients of reliability in a slightly different way in this 
test. The average of the correlations of the ist trial with the 2nd, 2nd 
with 3rd, and ist with 3rd, gives 68 for the Good group, 86 for the Poor, 
and 82 for the combined groups. 
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to ability to compare lengths of lines than test IX, it must be 
carried out far more extensively than that test, if reliable 
scores on a sufficiently fine scale are to be secured. The diffi- 
culty here, of course, is due to the fact that the use of the 
method of right and wrong cases makes the test a long one, as 
so many trials have to be given. It would doubtless be worth 
the trouble to find out by experiment to what extent scores on a 
basis of relative position from a comparatively short series of 
tests such as those described here, would differ from scores se- 
cured from a long and elaborate series of tests sufficiently ex- 
tensive to justify the exclusive use of the method of right and 
wrong cases in scoring. 

The coefficients in the Adding test were : y6 for the Good 
group, 90 for the Poor, and 91 for all together. The fact that 
the coefficient is slightly higher for the Poor group than for 
the Good is probably to be explained by two reasons. First, 
there was more variation in amount of practice in adding in the 
Poor group; second, in general in all tests correlated with intel- 
lect there was more variability in the scores of the Poor group 
than in those of the Good group. In other words, the Goods 
were a more homogeneous group in respect to addition than the 
Poors. 

In the Hard Opposites test, the coefficients of correlation be- 
tween the two halves of the test were : 60 for the Good, 88 for 
the Poor, and 97 for all together. As in the Adding test, the 
reliability was considerably higher for the Poor group than for 
the Good. It is a good test, though it could no doubt be im- 
proved somewhat by leaving out some of the most difficult 
words, and by carefully selecting lists of words more nearly 
equal to one another in difficulty. This would be by no means 
easy to do, as words which are familiar to many may chance 
to be little used by a few. However, a word whose opposite is 
so difficult for most people as the word unless had better be 
omitted, as a large number omitted it altogether, while others 
spent much time in thinking out its opposite; and this makes it 
very difficult to score the results in a way that is just to both. 
Different tests of varying degrees of difficulty would be well 
worth devising and perfecting. With the least efficient members 
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of the Poor group the test became one of range of vocabulary 
rather than of abihty to think up different opposites. 

Text XIII, Completing Words, or ba test, was not satisfactory 
in the case of the Good group. The coefficients of correlation 
between the two halves of the test were : 27 for the Good, 89 
for the Poor, and 92 for all together. On account of the greater 
variability of the Poor group, a less perfect test serves to bring 
out their comparatively marked individual differences, whereas 
on the other hand in a group that is more highly selected, and 
whose individual differences are not so pronounced, they may 
be easily obscured or covered up by " chance variation " due 
to imperfections in the test itself. Thus a member of the Good 
group might have a relatively high difference between his score 
in the first and second trials, owing to hesitation over the com- 
pletion of one or two words, while in the Poor group, the same 
amount of hesitation would not have nearly so much influence, 
as some of the slowest hesitated at practically all of the words, 
and only 5% of the Poor group reached the median of the Good 
group. 

From this it appears that the test as it is, is a fairly good 
one for bringing out group differences, but not perfect enough 
to bring out fine individual differences. It could be easily im- 
proved so as to make it satisfactory for this purpose as well. It 
would be better to have the syllables arranged in chance order 
instead of as they were, so as to avoid such very easy sequences 
as ba-t, ca-t, rfo-te, ea-t, fa-t, ga-te, etc. The test could also 
be improved by eliminating some of the most difficult of the 
syllables. Two or three tests carefully graded in difficulty 
would be well worth trying out, but it seems to the writer that 
the most significant and reliable results would be obtained from 
the use of combinations that could be completed quickly and 
with little hesitation over single words. 

Test XIV, Ebbinghaus mutilated test, was very high in relia- 
bility as used, easily the highest of all. The coefficients or cor- 
relation between the two halves of the test are: 96 for the 
Good, 93 for the Poor, and 92 for all together. It is beyond 
question a good test, and should be perfected and standardized, 
with a considerable number of specimens graded in difficulty. 

Test XV, Absurdities, was discarded, as far as calculating 
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correlations with other tests was concerned, on account of im- 
perfections in the test itself, which made it difficult to find a 
satisfactory method of scoring the results. In general, sheet A, 
(see appendix, page 121), seems rather too easy to be of much 
value. 
No. I was missed by only i good subject, and by 5 of the poor 

group. 
No. 2 was wrongly marked by i good subject, and by 5 of the 

poor group. 
No. 3 was wrongly marked by i good subject, and by 6 of the 

poor group. 
No. 4 was omitted by i good subject, and by 3 of the poor 

group. 
No. 5 was omitted by 4 good subjects, and by 5 of the poor 

group. 
No. 6 was wrongly marked by o good subjects, and by 3 of the 

poor group. 
No. 7 was omitted by o good subjects, and by o of the poor 

group. 
No. 8 was wrongly marked by o good subjects, and by 3 of the 

poor group. 
No. 5 is of doubtful absurdity, and should be thrown out, 
while No. 7 should be thrown out on account of being too 
evident. 

In sheet B (see appendix, page 121). 
No. I was wrongly marked by 6 good subjects and by 3 of the 

poor group. 
No. 2 was omitted by i good subject, and by 8 of the poor 

group. 
No. 3 was omitted by o good subjects, and by 6 of the poor 

group. 
No. 4 was wrongly marked by o good subjects, and by 2 of the 

poor group. 
No. 5 was omitted by i good subject, and by 4 of the poor 

group. 
No. 6 was wrongly marked by i good subject, and by 3 of the 

poor group. 



50 Correlations of Mental Abilities 

No. 7 was wrongly marked by i good subject, and by 4 of the 

poor group. 
No. 8 was omitted by 8 good subjects, and by 8 of the poor 

group. 
No. I should be thrown out on account of general uncertainty 
as to its absurdity, and No. 8 on account of the ambiguity of 
the word " dummy.'' 

In sheet C (see appendix page .122), 
No. 1 was wrongly marked by i of the good group, and by 3 of 

the poor. 
No. 2 was omitted by 4 of the good group, and by 9 of the 

poor. 
No. 3 was omitted by 6 of the good group, and by 10 of the 

poor. 
No. 4 was wrongly marked by o of the good group, and by 6 of 

the poor. 
No. 5 was omitted by 4 of the good group, and by 14 of the 

poor. 
No. 6 was omitted by i of the good group, and by 8 of the 

poor. 
No. 7 was wrongly marked by 2 of the good group, and by 4 of 

the poor. 
No. 8 was wrongly marked by 15 of the good group, and by 7 

of the poor. 
No. 3, while missed by 6 of the Good group, can hardly be 
considered a poor test unless it be that the wording is somewhat 
faulty, for practically everyone knows that the cream rises to 
the top. In most cases the failures here were due simply to 
not applying the familiar information, and so failing to make 
the simple deduction. Some, however, found fault with the 
wording, and would not admit the absurdity when pointed out. 
If worded, "As everyone knows, a pint of cream weighs slightly 
more than a pint of milk," this objection would be overcome. 

No. 5 is sufficiently doubtful to be excluded. This is further 
substantiated by the fact that of 20 advanced students and pro- 
fessors who afterwards marked the sentences at their leisure, 
only 9 marked it as absurd, and i as doubtful. No. 6 would of 
course be faulty if the name Pontius Pilate were unfamiliar to 
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any of those tested. Perhaps John the Baptist, or Julius Caesar 
wpuld be preferable. No. 8 should be thrown out on account 
of ambiguity. 

In sheet D, (see appendix page 122), 
No. I was wrongly marked by o of the good group, and by 6 of 

the poor. 
No. 2 was omitted by 5 of the good group, and by 8 of the 

poor. 
No. 3 was omitted by 4 of the good group, and by 13 of the 

poor. 
No. 4 was wrongly marked by 8 of the good group, and by 8 of 

the poor. 
No. 5 was wrongly marked by o of the good group, and by 3 of 

the poor. 
No. 6 was omitted by 2 of the good group, and by 9 of the 

poor. 
No. 7 was omitted by 4 of the good group, and by 9 of the 

poor. 
No. 8 was wrongly marked by 7 of the good group, and by 12 of 

the poor. 
Some who missed No. 3 would not afterwards admit its ab- 
surdity on the ground that there is nothing absurd about the 
statement that some states have very absurd laws. While this 
objection does not seem to the writer to be very well taken, it 
would be better to find a re-wording of the point which would 
not be open to this objection. No. 7 should be thrown out as its 
absurdity is open to question. No. 8 however is not absurd, in 
spite of its being marked as such by 4 of the good group, and 
marked as doubtful by 3 others. It was not marked as absurd 
by any of the advanced students and professors who marked 
the absurdities at leisure. 

If the records are scored by the plan originally intended, in 
spite of the imperfections pointed out above, the group dif- 
ferences brought out by the test would be considerable. None 
of the Poor group reach the median of the Good group, 25% 
of the Poor group reach the lowest 4 of the Good group, and 
70% of the Poor reach the lowest i of the Good group. Of 
course this simple scoring does not take into account the time 
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element at all, and as the test was given, no scoring would be 
fair which did not take the time element into account. While 
the same instructions were given to all, the members of the 
Poor group took their leisure to a far greater extent than mem- 
bers of the Good group. There is no doubt that many of the 
Good subjects would have made fewer mistakes if they had 
taken as much time as members of the Poor group. It seems 
to the writer that the best way to give the test would be to allow 
each subject the same amount of time for marking each sen- 
tence. 

While the results could be scored by throwing out the objec- 
tionable sentences and taking into account the time element, it 
seems hardly worth while to do so without improving and ex- 
tending the test. Its reliability as given would probably be low. 

An improved test of this sort would surely reveal character- 
istic individual differences, and with a sufficiently large number 
of sentences the reliability of the test could be made satisfac- 
tory as long as it had not been previously seen by the subject. 
Just how significant this ability is in relation to general intelli- 
gence, is of course doubtful, but the correlation would probably 
be fairly high. Undoubtedly the most and the worst mistakes 
were made by the most stupid of the Poor group. 



TABLE VIII 
Extent to Which the Poor Group Overlaps the Good Grohp 
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VI. Significance of the Tests and Analysis of General 
Intelligence as Shown by the Differences Be- 
tween THE Good Group and the Poor Group. 

I. Extent of overlapping in the different tests 

Table VIII summarizes the group differences brought out by 
the tests. 

The Ebbinghaus and the Hard Opposites tests separate the 
two groups almost completely. The only exception is the case 
of one of the Poor group who surpassed two of the Good group 
in the Ebbinghaus test. The subject in question is the only one 
of the Poor group who held a responsible position at the time, 
viz., that of Assistant Superintendent of the Salvation Army 
Industrial Home. This shows very decidedly the fact that in 
the powers called into play in these two tests, the Good group is 
far superior to the Poor. 

The Memory tests separate the groups decidedly, but not so 
completely as the two tests just mentioned. None of the Poor 
group reached the median of the Good group in either Memory 
of Words or Memory of Passages. Only 10 per cent of the 
Poors reached the lowest 12 per cent of the Goods in Memory 
of Words, and only 15 per cent reached this standard in Memory 
of Passages. As one of the Good group did very poorly in 
Memory of Passages, 40 per cent of the Poor group surpassed 
his record. It is possible that reading the memory tests instead 
of allowing the subjects to read them, i.e., making the tests audi- 
tory rather than visual, may in some cases have put members of 
the Good group at a disadvantage. 

The Learning Pairs test separates the groups completely, ex- 
cept that 30 per cent of the Poor group surpass the record made 
by the lowest one of the Good group. This test is, without 
doubt, somewhat more novel to the Poor group than to the 
Good, on account of the fact that the Good group have studied 
languages and vocabularies. 

The Recognizing Forms test also separated the two groups 
completely, except that 10 per cent of the Poor group reached 
the lowest one of the Good group. It is to be remembered, how- 
ever, that this test was not satisfactory from the standpoint of 
reliability. 
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The Association test, Easy Opposites, and Adding, do not 
separate the two groups equally well. The Easy Opposites test 
does so completely, while in the Adding test lo per cent of the 
Poors reach the median ability of the Goods, 20 per cent of the 
Poors reach the lowest 24 per cent of the Goods, and 30 per 
cent of the Poors reach the lowest individual of the Goods. 

The fact that the estimated true correlation between Adding 
and Easy Opposites is only 56 (see Table XIV), suggests that 
it is hardly justifiable to class them both under the same name 
— Association — if this is to imply that they are tests of the 
same thing. The low correlation is suggestive of disturbing fac- 
tors — probably different amounts of practice in Addition by the 
different subjects. At any rate the one is a language test and 
the other is not. The Easy Opposites test also seems to re- 
quire selective thinking, responding to the elements of a situa- 
tion and thinking things together, in the case of the Poor group, 
to an extent that the Addition does not. 

The Completing Words or ba- test stands about midway be- 
tween the Easy Opposites test and the Addition test in respect 
to the way in which it separates the two groups. There are 
5 per cent of the Poor group who reach the median ability of 
the Good group, 10 per cent who reach the lowest 24 per cent 
of the Goods, 15 per cent who reach the lowest 12 per cent of 
the Goods, and 15 per cent who reach the lowest one of the 
Good group. 

The Perception tests, A test and Geometrical Forms, separate 
the two groups to about the same extent. In the A test 15 per 
cent of the Poors reach the median ability of the Goods, 20 
per cent of the Poors reach the lowest 24 per cent of the Goods 
in both tests, 25 per cent of the Poors reach the lowest 12 per 
cent of the Goods in both tests, while 35 per cent of the Poors 
reach the lowest one of the Goods in the Geometrical Forms 
test. 

The Scroll test will be seen by the table to divide the two 
groups to about the same degree as the Geometrical Forms test, 
though conclusions here are not to be much relied upon on ac- 
count of the low reliability of the test in the Good group. 
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The tests in Estimating Lengths and in Drawing Lengths 
show little difference between the two groups. In fact 55 per 
cent of the Poor group actually surpass the median ability of the 
Good group in Drawing Lengths, and 85 per cent of them are 
above the lowest one of the Good group. In Estimating Lengths 
the Good subjects are only slightly superior. 

Thus the tests reveal very marked differences in the two 
groups in language tests demanding selective thinking; marked 
but less difference in certain tests of memory; very decided dif- 
ferences in language tests demanding speed and accuracy in easy 
association; less difference in the more directly practiced and 
mechanical associations demanded in adding; in perception tests 
and in motor control the differences are somewhat less still ; and 
in discrimination of lengths they are least of all. 

That the differences brought out by the tests are not due merely 
to differences in training and education is demonstrable from 
facts which will be discussed in detail later. We believe it can 
be shown that by far the largest factor in causing these dif- 
ferences is the native capacity of the individual in question. 

2. Mental relationships revealed by Pearson coefficients of cor- 
relation 

The raw Pearson coefficients of correlation are given in Table 
IX. The top line gives the coefficient of correlation when all 
subjects are taken together, in one group of 37 persons. The 
second line gives the correlations secured when members of 
the Good group are taken separately, and the last line, the cor- 
relations when the Poor subjects are taken by themselves. 

It will be noted that on the whole the correlations are con- 
siderably higher in the top lines, i.e., where the two groups 
are taken together. Again the correlations of the second line, 
i.e., correlations of the Good group by itself, will be seen to 
be somewhat lower on the average than those of the third line 
where the Poor group is taken separately. The only clear ex- 
ceptions to this are in the Adding test, and in the unreliable tests. 
In case of the Adding test, it is doubtless due to practice enter- 
ing as a disturbing factor. These differences in the amount of 
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TABLE IX 



Pearson Coefficients of Correlation, Raw 
In the case of each test the heavy-face figure given first is for the Good 
and Poor together, divergences being measured from the median of the 37 
individuals. The second figure is for the Good group, divergences being 
measured from its median. The third figure is for the Poor group. 
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the correlation are of course due to the facts (i) that the 
groups represent different selections, and (2) that the points 
from which the deviations are measured are different. Thus 
(i) the correlation between height and weight would be greater 
if one took human beings of all ages than if one took only nine- 
teen-year-olds or only the new-born. Thus (2) the correlation 
between height and weight in the new-born would be greater if 
each baby's height and weight were treated as deviations from 
the average for all human beings of all ages, than it would be 
if each baby's height and weight were treated as deviations 
from the average of the new-born. Hence in our amalgamated 
group, as two extreme grades of ability are represented, and 
measured by the deviations from the approximate central ten- 
dency of all men, the correlations are high wherever the traits 
concerned are themselves correlated with general intelligence. 

Having in mind these facts that the size of the coefHcient of 
correlation is affected by how the group chosen is selected, and 
by what central tendency is taken from which to measure the 
deviations to be related, the question arises, " What would be 
the true correlations if instead of taking two small selected 
groups, we took a very large group representing a normal dis- 
tribution of human minds ? " This question can be discussed 
only after we have first compared the raw correlations in this 
table with the probable true coefficients which would have been 
got, if the original measures had each been a perfect measure 
of the average conditions of the trait in question, in the indi- 
vidual in question, instead of a measure secured by only a few 
minutes' sampling of his ability. These probable true coeffi- 
cients obtained by the formula, 

Rpq= 4 are given in Table X. 

V Rpip2 + Rqil2 

The coefficients of correlation to be used in correcting for 
attenuation the Pearson coefficients of correlation given in Table 
IX, are given in Tables XI, XII, and XIII. 

The deviation measures from which they are computed are 
given in Tables V a, V b, V c, VI a, VI b, and VI c. 



S8 



Correlations of Mental Abilities 



TABLE X 
Pearson Coefficients of Correlation (Corrected for Attenuation) 
In the case of each test the heavy-face figure given first is for the Good 
and Poor together, divergences being measured from the median of the 37 
individuals. The second figure is for the Good group, divergences being 
measured from its median. The third figure is for the Poor group. 
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Probable Error op the Pearson Coefficients of Correlation 







.6745 (1— r^) 






{Using the formula, P.E. 


\/n 






Group I, 


Group II, 


Group III, 




rv^37 


n=17 


n=20 


r. 


P.E. 


P.E. 


P.E. 


.10 


.11 


.16 


.15 


.20 


.11 


.16 


.14 


.30 


.10 


.15 


.14 


.40 


.09 


.14 


.13 


.50 


.08 


.12 


.11 


.60 


.07 


.10 


.10 


.65 


.06 


.09 


.09 


.70 


.06 


.08 


.07 


.75 


.05 


.07 


.06 


.80 


.04 


.06 


.05 


.85 


.03 


.05 


.04 


.90 


.02 


.03 


.03 


.95 


.01 


.02 


.02 



It will be observed in these tables that the correction for at- 
tenuation cannot be made in certain cases. Such are (i) cases 
(like the correlation between Memory of Passages and Geome- 
trical Forms for the Good group) where the average of the 
raw coefficients, -2i, comes out negative, though the facts for 
the Good and Poor together, and facts found by other workers 
in this field, prove that this inverse correlation is a chance re- 
sult from the small number of cases. Such are also (2) cases 
where either or both of the coefficients of correlation for the 
two trials of the same test are negative, as for instance in Recog- 
nizing Forms and Hard Opposites, the denominator would be 
V-41 X 60. Such negative correlations in the denominator of 
the correction formula are due either to a chance absurdity due 
in turn to the small number of cases, or to facts unrevealed in 
the measures themselves, which make the two trials with the 
same test not a random selection. 

What the true correlations would be if the subjects were 
representatives of people in general, can only be roughly esti- 
mated from the above data. They would of course be lower 
than those given in the top line of the table, and higher than 
those given in the second line, since a normal distribution of 
persons would vary less on the average than those of the amal- 
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gamated group, and more than the Good group. The true cor- 
relation would probably be considerably higher than that given 
by the Good group separately. If we take the average of the 
correlations given in the second and third lines of Table X, and 
then take the average of this result and that given in the top 
line, we shall probably make as close an approximation to the 
true correlation for people in general as could be arrived at 
from tliis data. Thus to get the estimated true correlation be- 
tween the Ebbinghaus test and Hard Opposites, take the average 
between 66 and 90, which is 78, and then take the average be- 
tween 78 and 92, which gives us 85 as the estimated true correla- 
tion for a group of normal distribution. The estimated true 
correlations for twelve of the tests of most satisfactory relia- 
bility are given in Table XIV. 



TABLE XIV 
Estimated True Cohrelation for People in General 

(That is, the probable correlations as they would be if the subjects were 
a very large number of persons representing a random sampling of all peo- 
ple, instead of two small selected groups. This table is compiled from Table 
X as explained above.) 
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For the remaining two tests, Recognizing Forms and Scroll, 
it is somewhat less safe to venture an assertion as to the prob- 
able true correlations with each of the other tests. As has 
been stated above, the ordinary methods of correction cannot 
be applied on account of the low reliability of the tests them- 
selves. The average of the correlations RpiQi, Rpi<l2> Rpili, 
RpiQi, that is, the numerator of the correction formula, gives 
us on the whole a result considerably less than the raw coeffi- 
cient, which is itself too small. I have calculated the probable 
true correlations between the Scroll test and the Recognizing 
Forms test, with each of the other tests, in the same way as in 
Table XIV, except that they are calculated on the basis of the 
raw coefficients instead of the correlated coefficients. The re- 
sults are: 
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They are probably at least lo per cent too low, but I leave 
them as they are, as they are uncertain at best with my data. It 
will be seen, however, that the Scroll test correlates much less 
highly with the other tests than does Recognizing Forms, in 
spite of the fact that on account of its greater reliability, the re- 
sults as given are less attenuated by chance errors. 

As is shown in Table XIV the order in which the different tests 
(exclusive of Recognizing Forms and Scroll) correlate with the 
other tests is : Hard Opposites, Ebbinghaus, Memory of Words, 
Easy Opposites, A test. Completing Words, Memory of Pas- 
sages, Adding, Learning Pairs, Geometrical Forms, Estimating 
Lengths, Drawing Lengths. 
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3. Grouping of the tests according to relationships shown by 
the correlation coefficients 

(i) Tests of selective thinking 
Table XIV, Estimated True Correlations, shows us that the 
correlation between the Ebbinghaus test and Hard Opposites is 
85. This high correlation suggests that we are not far wrong 
in classing these tests together as we have done in calling them 
both tests of selective thinking. In fact this correlation is al- 
most as high as the correlation between the first and second 
trials of each test. 

(2) Memory tests 

We see also that the correlation between Memory of Passages 
and Memory of Words is 80. On this basis we are justified in 
classing them together as memory tests, implying that they test 
in large measure the same ability. 

(3) Association tests 

It would appear, however, that we would not be justified in 
grouping Adding and Easy Opposites together as tests of asso- 
ciation in the sense of implying that they both are tests of the 
same thing. The correlation between them is only 56, not 
nearly as high as is the correlation between Easy Opposites and 
the tests of selective thinking, particularly Hard Opposites 
which is 83. The fact is that the Easy Opposites test is a test 
of selective thinking, especially for the Poor group. 

The Completing Words test is correlated more closely with 
Adding than with any other test, yy; the correlation between 
Easy Opposites and Completing Words is 62. 

Learning Pairs does not correlate closely with Adding, Easy 
Opposites or Ba and so cannot be classed with them as a test 
of the same kind of thing. 

(4) Perception tests 

On the other hand, the correlation between A test and 
Geometrical Forms is 87, being very much higher than the cor- 
relation of either test with any other test. Hence our justifica- 
tion in grouping them. 
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(5) Motor control 

The Scroll test has turned out to be quite unsatisfactory as 
to reliability in the case of the Good group. If we assume that 
it gives us satisfactory results in the case of the two groups com- 
bined, and in the case of the Poor group by itself, the co- 
efficients of reliability being 76 and 71 respectively, we could on 
this basis estimate the degree of its relationship to the other 
tests, and to General Intelligence. Just to what extent ability in 
this test is significant of other sorts of motor ability, we are of 
course unable to say. 

(6) Discrimination of lengths 

The estimated true correlation between Drawing Lengths and 
Estimating Lengths is only 26. In making this statement, how- 
ever, we should add that the figures are relatively untrust- 
worthy on account of the low reliability of the tests themselves, 
particularly Estimating Lengths, making the correction formula 
scarcely applicable. However, the very considerable differences 
between the correlation of random halves of the same test with 
each other, and the correlation of the two tests with each other, 
make it clear that there are very appreciably different factors 
involved in the two tests as given. 

4. Order in which abilities correlate with other abilities tested 
In order to determine which of the tests correlate most highly 
with other tests, we have simply to sum up the totals of the 
columns given in Table XIV. The average correlation of each 
test with the eleven other tests is : 

Hard Opposites 60, Ebbinghaus test 58, Memory of Words 
56, Easy Opposites 53, A test 50, Completing Words 47, 
Memory of Passages 44, Adding 43, Learning Pairs 41, 
Geometrical Forms 40, Estimating Lengths 26, Drawing Lines 
13. On the basis of our separate estimate, the figures for Recog- 
nizing Forms and Scroll would be 41 and 26 respectively. 
Grouping the related tests we have : 

Average correlation of Selective Thinking with other 

tests 59 

Average correlation of Memory with other tests 50 
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Average correlation of Association (exclusive of Learning 

Pairs) with other tests^ 48 

Average correlation of Perception with other tests 45 

Average correlation of Motor Control with other tests. . 26 
Average correlation of Discrimination of Lengths with 

other tests 19 

Provided only we have appropriately named the abilities 
above mentioned, all this means that, using the argument from 
correlation alone, power of selective thinking is more intimately 
connected with, and more characteristic of, general mental 
ability than is any of the other abilities tested; that memory is 
next most highly correlated with general ability; the simpler 
forms of association next; perception next; motor control con- 
siderable less; and discrimination of lengths least of all. This 
confirms the more direct argument stated above, from the ex- 
tent of overlapping of the Good and Poor groups in the different 
tests. 

5. Analysis of the individual differences revealed by the tests 
and evidence that they are largely due to differences 
in native mental capacity 

We are now in a better position to analyse the differences 
between the two groups that are revealed by the tests, and to 
indicate to what they are due. The most obvious difference 
between the two groups of persons is of course a difference in 
efficiency. That the members of the Poor group were relatively 
inefficient is shown by the nature of their previous employments, 
and by the fact that all but two of them were out of regular em- 
ployment. On the whole it is the more inefficient ones who are 
the first to be thrown out of employment. While an occasional 
one loses a good position owing to some bad habit such as in- 
temperance, dishonesty, etc., the many lose their positions owing 
to lack of initiative and zeal, stupidity, inability to make them- 
selves so useful that they cannot be easily replaced if dis- 
missed; in short, through general lack of efficiency. Moreover, 
not one of these persons, with a possible exception of No. 18, 
the brakeman who was then assistant manager of the Salvation 
Army Industrial Home, had ever held a position calling for 



' Including Learning Pairs, 46. 
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much exercise of intelligence, though they had reached years of 
maturity. The median age was 36, ranging from 23 to 52. As 
to the amount of schooling they had had, there was considerable 
divergence, the median number of years schooling being 7 and 
ranging from i to 12. With regard to those who had a small 
number of years' schooling it is safe to assume that in a rather 
large proportion of cases inability to get on well at school had 
a -good deal to do with it. On the basis of these facts it seems 
clear that the twenty members of the Poor group represent a 
grade of intelligence and mental capacity very far below the 
average. 

In the opinion of the writer, who spent from four to six 
hours with each of them and had an excellent opportunity to 
judge the characteristic methods of each in attacking mental 
difficulties, it is practically certain that at least fifteen of the 
twenty represent that quality of mind which ranges from some- 
what below the average in mental capacity to that which is only 
slightly above the feeble minded. They represent persons who, 
on the whole, are not at all bright, even if not positively dull 
in school; who have no particularly strong interests or am- 
bitions, and who are likely on leaving school to follow the line 
of least resistance in earning a living, doing most things they 
attempt with indifferent or poor success. They represent the 
kind of person relatively lacking in foresight, energy, self-re- 
liance, and all round mental capacity; the person likely to suc- 
ceed fairly well only if well directed and especially trained in 
some particular line of work, but who, under the most favorable 
conditions possible, is utterly incapable of attaining a degree of 
achievement much beyond mediocrity. At least six of these 
were persons considered dull by their acquaintances. As to the 
remainder of the group, in the opinion of the writer, they repre- 
sent persons ranging in mental capacity from that of the fifteen 
described above to something above the average in the case of at 
least one or two. 

The Good group, on the other hand, is composed of persons 
who are far above the average in mental capacity. It is safe 
to say that persons who become college professors or instructors 
before the age of thirty-five possess mental ability ranging from 
considerably above the average toward the very highest mental 



SigW^cance of Tests and Analysis of General Intelligence 69 

capacity. That the members of the Good group are representa- 
tive of mental abiHty far above the average is evidenced by the 
nature of the positions they now hold. Eight or nine of the 
seventeen at present occupy college or university positions. The 
rest hold educational positions, mostly in normal schools of high 
rank, with the exception of one, who holds an important and 
responsible position in a philanthropic organization. There can 
be no reasonable doubt, therefore, that the two groups are repre- 
sentative of very decided contrasts of mental ability. 

One may say that the qualities making for success are in 
large part moral — capacity for self-control, self-denial, indus- 
try, conscientiousness, etc. Some of these qualities or tendencies 
are such as to be indicated to a close observer, in the way in 
which the mental tests were attacked. Analysis would un- 
doubtedly show positive correlation between mental and moral 
qualities. In addition, one of the most striking differences be- 
tween the two groups was a decided contrast in attitude with 
regard to time. All of the Good group did the tests with a 
high regard for the value of time, while such a thing as the 
time, element being of much significance or importance hardly 
seemed to dawn upon members of the Poor group, even though 
the instructions given were the same for both groups. 

Having considered group differences on the basis of general 
principles, let us now consider the differences in the two 
groups as revealed by the tests. As above stated, page 53, the 
most decided difference between the groups is in the tests of 
selective thinking — Ebbinghaus test and Hard Opposites. This 
is exactly what the results obtained in investigations on the men- 
tality of the feeble minded would lead us to expect. All of 
those who have worked with the feeble minded agree that they 
are farthest removed from the normal in ability to deal with the 
abstract, ability to get and use meaning and significance. It is 
also demonstrated that the most painstaking educational efforts 
to improve their capacities in this respect are relatively ineffec- 
tive.^ The same would undoubtedly hold true, though to a less 
degree, of persons somewhat above the mentally defective. 

It may perhaps be objected that the two tests in question are 
simply language tests, and, as such, measures of amount of train- 

' See Goddard, Journal of Educational Psychology, November, 1911. 
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ing and education rather than measures of mental capacity. 
That this is not the case is evidenced by the low correlation be- 
tween number of years schooling and rank in the tests. The 
correlation between number of years schooling and rank in the 
eight tests correlating most highly with other tests is 38. This 
amount of positive correlation could be accounted for on the 
basis of the correlation between mental ability and staying on 
at school. Studies in retardation have shown that a considerable 
proportion of those who leave school at an early age, do so be- 
cause they are relatively unable to do the mental tasks required 
of them in school. In other words, in a considerable propor- 
tion of cases, a small number of years schooling means inability 
to learn advanced and difficult language work, rather than 
lack of opportunity to learn it. There is no way of calculating 
directly the correlation between general intelligence in the Poor 
group, and rank in the tests of selective thinking, as we have 
no way of independently ranking them in general intelligence. 
In the Good group, however, the correlation between estimated 
intelligence and rank in the tests of selective thinking is 92.5. 
In the Poor group it would probably be higher still as in general 
all other correlations are. Hence the tests of selective thinking 
do not measure mere training and schooling. 

This was further evidenced by a consideration of individual 
cases. In general those who were considered decidedly dull or 
stupid by their fellows, did poorest in the tests of selective 
thinking. 

Moreover, language tests similar to those here given are not 
unfair tests of ability as opposed to mere education, since the 
intelligence of a primitive people can be gauged by the language 
they find it necessary to evolve and use. Feeble-minded chil- 
dren are, on the whole, decidedly deficient in acquiring higher 
forms of language, while bright children with similar educational 
advantages, acquire language naturally and easily. It is the same 
with many of the Poor group. Their low grade of native 
mental ability has made them very slow to acquire average 
facility in the command of the higher form of language, and 
very difficult for them to acquire and make use of abstractions, 
fine shades of distinction in the meaning of words, etc. These 
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differences in native capacity come out in any classroom of an 
elementary school. 

In a certain sense all ability is dependent upon training and 
practice inasmuch as one cannot long retain a capacity unless 
it is more or less exercised. The skill of the talented musician 
could not be achieved without attention given to music. But 
a true way of viewing the difference between a musical genius 
and a person of average musical ability is just this: The in- 
terest and inborn capacity of the one is so great that it leads him 
to get abundance of practice and training in the capacity, where- 
as the weaker interest and inborn capacity of the other leads 
him to let the practice of other abilities and capacities predomi- 
nate. So with interest in, and ability to deal with the abstract. 
Gifted minds early become interested in mental exercises of a 
kind for which the dullard has no interest. Interest in fairy 
tales, romance, fiction, literature, science and philosophy, in so 
far as these involve the use of concepts, would lead to the ex- 
ercise and development of the higher forms of association and 
abstraction. In short, we may well look to language tests in 
some form, to furnish good tests of general intelligence. 

Moreover, the high correlations within the Good group itself, 
where there has been presumably no appreciable inequahty on 
the basis of opportunity for education, further substantiate the 
view that by far the most influential factor in making for effi- 
ciency in these tests, is the native capacity of the individual in 
question, and not simply his training and environment. The 
results in memory and association show nearly as much dif- 
ference between the two groups as do the tests of selective 
thinking, and the same general line of argument would hold 
with regard to them. 

To test still further the extent to which ability in certain of 
the tests is significant of general intelligence as commonly under- 
stood, there was taken for each individual a combined measure 
obtained by summing up his score in the Ebbinghaus test, Hard 
Opposites, Easy Opposites, Learning Pairs, and Recognizing 
Forms, arranged in such a way as to allow approximately equal 
weight to each test. The scores of the different subjects, in 
terms of the deviation from the median of all are: 
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Subject 


Deviation 


Subject 


Deviation 


1 


89 


18 





2 


61 


19 


—2 


3 


95 


20 


—15 


4 


49 


21 


—26 


5 


70 


22 


—9 


6 


43. 


23 


—26 


7 


39 


24 


—27 


8 


54 


25 


—31 


9 


59 


26 


—25 


10 


21 


27 


—13 


11 


53 


28 


—16 


12 


44 


29 


—32 


13 


42 


30 


—59 


14 


25 


31 


—45 


15 


49 


32 


—51 


16 


47 


33 


—119 


17 


21 


34 


—54 






35 


—59 






36 


—73 






37 


—127 



There is no one of the Poor group who reaches the lowest 
one of the Good group; in fact there is a decided gap of 21 
here. There is a much larger gap between the median of the 
Good group and the highest one of the Poor group, namely, 49. 
Thus we see how effective such a combined score is in separat- 
ing the Good group from the Poor. 

TABLE XV 
Ranks of Good Group for Imputed Intelligence 
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12 


10 
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13 


10 


11 


12 
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14 
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12 
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13 
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14 
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16 
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To further test the extent to which such a combined score is 
a measure of ' general intelligence,' the individuals of the Good 
group were rated in order of merit for general intelligence each 
by the rest of the group, so far as was possible, four years after 
the tests were taken. The rankings, including two rankings by 
the experimenter, made a month apart were as shown in 
Table XV. 

Taking the individuals who were ranked by ten or more 
persons, and using for each two random halves of his ranks, and 
all together, we have the following: 
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100% 
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100% 
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66.6 


58 





.64 


.30 
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' 2 " 


50 


82 


66.6 





1.36 


.64 
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5 " 


66.6 


66.6 


66.6 


.64 


.64 


.64 


5 


8 " 


66.6 


80 


73 


.64 


1.25 


.91 


8 


13 " 


66.6 


75 


73 


.64 


1.00 


.93 


13 


14 " 


66.6 


66.6 


66.6 


.64 


.64 


.64 


14 


' 9 " 


50 


83.3 


66.6 





1.43 


.64 


9 


12 " 


40 


83.3 


64 


— .38 


1.43 


.53 


12 


10 " 


100 


80 


89 


3.5 


1.25 


1,82 


10 


17 " 


100 


80 


89 


3.5 


1.25 


1.82 



' Inferring from the percentages the distances between the individuals 
in question in terms of the P.E. by means of the following table quoted from 
page 16 of "The Perception of Small Differences" by Fullerton and Cattell. 
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P.E. 
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P.E. 
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.00 


60 


.38 


70 


.78 


80 


1.25 


90 


1.90 


51 


.04 


61 


.41 


71 


.82 


81 


1.30 


91 


1.99 


52 


.07 


62 


.45 


72 


.86 


82 


1.36 


92 


2.08 


53 


.11 


63 


.49 


73 


.91 


83 


1.41 


93 


2.19 


54 


.15 


64 


.53 


74 


.95 


84 


1.47 


94 


2.31 


55 


.19 


65 


.57 


75 


1.00 


85 


1.54 


95 


2.44 


56 


.22 


66 


.61 


76 


1.05 


86 


1.60 


96 


2.60 


57 


.26 


67 


.65 


77 


1.10 


87 


1.67 


97 


2.79 


58 


.30 


68 


.69 


78 


1.14 


88 


1.74 


98 


3.05 


59 


.34 


69 


.74 


79 


1.20 


89 


1.82 


99 


3.45 
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Calling the degree of ability midway between subject No. 8 
and subject No. 13 the central tendency, we get Table XVI. (A 
case of 100 per cent is treated as equivalent to 3.5 P. E. because 
there is a high probability that with many judges the lOo's in 
these results would fall to 99 or even much lower.) 

TABLE XVI 

Deviation Measures Inferred from Per Cents of Judgments 
OF Superior 





Deviations 


Deviations 


Deviations 


No. of 


from Median, 


from Median, 


from Median, 


Subject 


1st half 


2nd half 


entire set 


1 


5,1 


7.9 


6.5 


4 


.60 


4.39 


2.94 


3 


1.60 


3.75 


2.64 


2 


1.60 


2.39 


2.00 


5 


.96 


1.75 


1.36 


8 


.32 


.50 


.45 


13 


— .32 


— .50 


—.46 


14 


—.96 


—1.14 


-1.10 


9 


— .96 


—2.57 


—1.74 


12 


— .58 


—4.00 


—2.27 


10 


—4.00 


—5.25 


—4.09 


17 


-7.5 


—6.50 


—5.91 



Using this deviation table, the coefficient of correlation cor- 
rected for attenuation was calculated between ' general intelli- 
gence ' as judged by one's fellows, and the record secured in the 
five combined tests. This correlation is 92. 

In order to test the reliability of this combined score, the co- 
efficient of correlation was calculated between the combined 
score for the first trials, and the combined score for the second 
trials. For the Good and Poor subjects together the raw co- 
efficient is 96; for the Good group (12 subjects only), 72; and 
for the Poor, 90. This high reliability of the combined score 
proves that fifty minutes or so spent in getting such a record, 
gives us a good measure of whatever it is that this combined 
score measures. That this, in turn, gives us a very significant 
indication of the general mental ability of the individual in ques- 
tion, is shown by the almost perfect correlation between score- 
in-the-combined-tests and estimated intelligence. 

Again the Pearson coefficients of correlation corrected for at- 
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tenuation were calculated between Estimated Intelligence and 
each of the eleven most reliable tests. The coefficients are : 

Estimated Intelligence and Hard Opposites 96 

Ebbinghaus test 89 

Memory of Words 93 

Memory of Passages 35 

Easy Opposites 82 

Adding 72 

Learning Pairs 34 

Completing Words (?) 100' 

A test 21 

Geometrical Forms 07 

Drawing Lengths —20 

(These coefficients are calculated of course on the basis of only twelve of 
the Good subjects.) 

Here again is indicated how reliable as tests of intelligence 
are the Ebbinghaus test, Hard Opposites and Easy Opposites. 
The order in which the diflferent capacities correlate with gen- 
eral intelligence is practically the same as shown in Table XIV, 
and further substantiates the results there stated. 



VII. Comparison of Results With Those of Other In- 
vestigators 

One of the earliest attempts to give an exact quantitative 
statement of mental relationships was that of Wissler ('01). 
The correlations obtained by Wissler, however, are on the whole 
much too low, owing largely to the fact that about twenty dif- 
ferent capacities were tested in less than fifty minutes, making 
the measure of each person's capacity in each trait altogether in- 
adequate. It is impossible on the basis of Wissler's data to es- 
timate how much the correlations he found would be raised by 
correction for attenuation due to inaccuracies in the measures 
themselves. 

In the case of the correlations obtained by Aikens and Thorn- 
dike ('03), also, the absence of data makes it still impossible to 
estimate how much they are influenced by attenuation. Impor- 
tant later studies demand detailed consideration. 

Norsworthy ('06) compared mentally defective children with 
normal ones by the use of tests similar to those here used. The 
mental capacities tested were: i, Ability to form abstract ideas 

'Not sufficiently reliable; as the reliability coefficients of the completing 
words test in the Good group is only 27. 
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(recognition of nouns) ; 2, Ability in appreciating relationships 
and in controlled association (part-whole test, genus-species, 
easy opposites) ; 3, Memory (words and sentences) ; 4, Ac- 
curacy and speed in Perception (A test and a-t test) ; 5, Percep- 
tion of weight (accuracy in judging relative weights) ; 6, Motor 
control (maze and form board). Numbers i and 2 are com- 
parable with our selective thinking and association tests, number 
S is probably somewhat akin to accuracy in judging lengths, and 
the maze test in number 6 is very similar to our scroll test. 

Miss Norsworthy found that the defectives were farthest 
removed from normal children in ability to deal with abstract 
data. This is shown in the following table, compiled from re- 
cords of about 137 cases, ranging in age from eight years up : 



% above 

median 

for 

ordinary 
children 



% above 
— 1 P.E. (or 
lowest 25% 
of ordinary- 
children) 



% above 
—2 P.E. (or 
lowest 9% 
of ordinary 

children) 



1. Height 

2. Weight 

3. Pulse 

4. Temperature 

5. Weight test 

6. A test 

7. a-t test 

8. Memory of unrelated words . . 

9. Composite of 5, 6, 7 and 8 . . . 

10. Dictation 

11. Memory of unrelated words. . 

12. Part-Whole test 

13. Genus-Species test 

14. First Opposite test 

15. Second Opposite test 

16. Composite of 13, 14, 15 and 16 



45 

44 

49 

26 

18 

9 

1 

6 

1 

10 



61 
66 
69 
59 
28 
18 
14 
18 
15 
10 
19 
17 
16 

0.9 

1 

1 



77 
77 
86 
77 
39 
34 
28 
27 
27 
21 
30 
27 
17 
5 
7 
10 



According to her results, the order in which the different 
abilities tested would correlate with intelligence is roughly as 
follows: abstraction and association; memory; various forms 
of perception; motor control. This agrees with our results in 
so far as they can be definitely compared. Miss Norsworthy's 
tests of abstraction and association would seem to be very 
similar, and to require almost equal powers of abstraction; 
hence the somewhat higher rank relatively of association as com- 
pared with our results. Abstraction is of course a relative term. 
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What would be severe trials of abstraction to a feeble minded 
person, or to an immature mind, might be mere tests of rapidity 
and ease of association and memory to an abler or more mature 
person. As we stated elsewhere, the Easy Opposites test to 
many of the Poor group, was like the Hard Opposites test for 
the Good group. 

Lewis M. Terman ('06) tested seven of the brightest and 
seven of the dullest boys of a group of five hundred elementary 
school pupils. The capacities tested were: 
Powers of Invention and Imagination as tested by ability to 

solve mental puzzles. 
Logical Processes as tested by solution of problems in arith- 
metic and other problems requiring original thinking. 
Mathematical Ability as tested by problems requiring the 

more mechanical phases of arithmetic. 
Mastery of Language as shown in word building, reading, 
Ebbinghaus Mutilated Text, spelling and facility in inter- 
preting commands. 
Insight as shown in the interpretation of fables. 
Ease of Learning the game of chess. 
Memory of geometrical figures, chess moves, a story read, and 

the solution of a mechanical puzzle. 
Motor Ability, as shown in learning visual-motor coordina- 
tions, skill in running down stairs, and in carrying a book 
on the head. 
Terman concludes that the bright boys are superior to the dull 
in all the mental tests, and inferior in the motor. This su- 
periority of the dull group in motor tests, if characteristic of 
persons in general, would of course mean a negative correlation 
between motor ability and " intelligence," and be in conflict with 
our conclusion of a small but positive correlation. It is cer- 
tain, however, that at least a part, and probably all, of this 
superiority of the dull group in motor ability was due to greater 
maturity on account of their being on the average 12.7 months 
older than the bright group. Again, it may be, in part, that 
boys in school who have some motor ability and interest in that 
kind of thing, are on that account more likely than other boys 
to give little attention to linguistic and abstract studies, and for 
that reason are likely to be considered less intelligent by their 
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teachers than they really are, and duller than boys who lack 
such interests altogether. The fact that the dull boys all pre- 
ferred games and the bright ones reading also suggests this con- 
clusion. On general principles it seems hardly likely that there 
should actually be a negative correlation between motor ability 
and intelligence, at least as far as native capacity is concerned. 
It would seem quite probable, too, that the groups here selected 
would not represent bright and dull children respectively as 
judged from a more general point of view than that of a school 
teacher. 

I have calculated the Pearson coefficient of correlation for 
each test with every other test, using Terman's scores for each 
of the fourteen boys taken all together in one group. In order 
to bring out the relationships between the different tests as fully 
and accurately as possible, I have taken the actual scores made 
instead of the ranks. The only exception to this was in the case 
of the ' Interpretation of Fables,' where on account of the omis- 
sions it was not very practicable to take the scores. The co- 
efficients are given in the table below. 
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66 


— 36 


Chess 


81 
86 
75 


82 
77 


82 
73 


77 
73 


65 
74 
62 


78 
64 

.■;8 


62 
70 
63 


— 23 


Mathematics 


— 48 


Logical Processes 


—22 




78 


65 


74 


62 




58 


72 


— 26 


Invention 


7?! 


78 


64 


58 


58 




36 


— 14 


Fables (interpretation of) . 


66 


62 


70 


63 


72 


36 




—52 


Motor Ability 


—86 


—23 


—48 


—22 


—26 


—14 


—52 










422 


422 


401 


386 


383 


350 


317 


—221 



The table gives of course the raw coefficients, and correction 
for attenuation would raise them somewhat. 
On the other hand these fourteen boys, like our thirty-seven 
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men, represent a selection half from near the top and half from 
near the bottom of the scale of general intelligence. In so far 
forth the correlations are too high. They should be thought of 
much as the raw correlations for the Good and Poor together 
in our own experiment. The high intercorrelation (with the ex- 
ception of the motor tests where the negative correlation in evi- 
dence has already been accounted for) are in harmony with our 
results. 

W. C. Bagley ('01) reported a negative correlation between 
motor skill and intellect. Professor E. L. Thomdike^ has shown 
that this result was due to an arithmetical error, and to over- 
sight of the influence of the age factor. When children of 
nearly the same age are taken, Bagley's data give a slight posi- 
tive correlation between mental and motor ability. 

Binet ('99) has attempted to differentiate intelligent pupils 
from unintelligent by the use of tests of voluntary attention. 
He considers that the tests he used are not, properly speaking, 
tests of comprehension, but depend upon processes relatively 
more simple, — especially acts of memory and comparison. He 
tested eleven subjects, five intelligent and six unintelligent. The 
tests included accuracy in tactile sensibility; quickness of re- 
action time; speed and accuracy in counting dots; perception of 
change in the rate of speed of rhythmic movement ; counting of 
rhythmic sounds; copying figures, different varieties of sen- 
tences and drawings; rapidity of perception of words and 
figures seen for a fraction of a second; accuracy and speed in 
picking out one or more letters from a printed page (our A test 
and an extension of it) ; and simultaneous adding. 

Binet found that a number of the tests did establish a clear 
differentiation between the bright and the dull pupils, and that 
they are therefore promising as tests. Those which turned out 
to be the best in this respect he found to be the tests of accuracy 
in tactile sensibility, counting rhythmic sounds, copying figures, 
sentences and drawings, memory of figures, and cancellation of 
letters. He found also that this difference is most marked at 
the first trials, but diminishes with subsequent trials, and that 
in some cases it may be effaced. On account of the small 
number of Binet's subjects, and on account of the fact that, sev- 

' Educational Psychology, 1903 edition, pp. 148, 149. 
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eral of the tests being new, the subjects' records are hard to 
evaluate with accuracy, it is unsafe to make very specific com- 
parisons. However, subject to these limitations, the group dif- 
ferences brought out by the tests are shown in the two following 
tables. Table XVIII gives a comparison of the average per- 
formance of the bright group with that of the dull, on the basis 



TABLE XVII 



Bright 



Dull 



Tactile perception, errors with points 2 cm. apart... . 

Counting of dots, errors in score 

Counting of sounds, errors in score 

Copying figures, average number 

Memory; numbers forgotten in 5 numbers of 5 figures 
Reading through an opening, errors (Rapidity of 

Apperception) 

Simultaneous adding, errors 

Cancellation of letters, errors 

Simple reaction time 



20% 


64% 


3.4% 


5% 


4.3% 


20% 


3.6 


2.8 


12 


42 


34 


34.5 


19% 


23% 


9% 


27.5% 


24 


24% 



TABLE XVIII 
R.4.NKIN6 OP Pupils in the Different Tests 
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of the first trial of the test only, in order to rule out the factor 
of mental adaptation through practice. 

Binet lays great stress upon the conclusion that the difference 
between the intelligent and the unintelligent consists in quick- 
ness of adaptation. The bright pupils adapt themselves more 
quickly than the dull; the dull adapt themselves in turn, but 
more slowly. In so far as this were true, other things being 
equal it would imply, with reference to correlations between 
different tests, that they should be relatively high with unprac- 
ticed subjects, and would gradually diminish as the subjects had 
more and more practice. 

Spearman and Krueger in working over Oehrn's results from 
ten subjects, including five medical doctors and three medical 
students, found that up to a certain point correlations increase 
in size as practice increases, and that after that point they again 
tend to diminish. The decrease in this case may be due to 
fatigue entering as a disturbing factor as the test is prolonged. 
The change in the size of the correlations as practice proceeds 
is shown in the following table. 



TABLE XIX 

Uncorrected Coefficients of Correlation for Each Successive 
Quarter Hour, According to the Results of Oehrn 



Quarter Hour 



Writing and Adding 

Writing and Counting 

Writing and Reading 

Writing and Learning by Heart .... 

Adding and Counting 

Adding and Reading 

Adding and Learning by Heart . . . . 

Counting and Reading 

Counting and Learning by Heart. . . 
Reading and Learning by Heart 



1st 2nd 3rd 4th 5th 6th 7th 8th 



50 68 72 65 64 68 55 68 
58 67 70 75 81 71 54 58 
32 42 51 53 48 38 42 47 
10—02—03—03 03 02 25—08 
37 56 69 67 64 59 50 31 
01 14 24 18 05—18 22 26 
22 24—09—02—26 00—07—13 

—17—16—04 05 14—10—26—21 
•24 —22 —27 —23 —15 —02 —16 —21 

—05 07 08—10 19 05 03—10 



That increase or decrease would depend upon the kind of test 
and the stage of the subjects in the learning process, and be sub- 
ject further to such disturbing factors as fatigue, change in zeal, 
etc., seems quite evident. In concluding. Spearman and Krueger 
say that the lack of being accustomed to the test instead of act- 
ing as a cause of correlation, would act as a disturbing factor 
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lessening it. Which effect it would have, however, would surely 
vary according to circumstances. If conditions were such that 
the more intelligent could in the time allowed acquire a method 
that they could use to advantage, while the poorer ones did not 
get to that point, the correlations would be correspondingly high. 
If, on the other hand, conditions were such that even the best 
of the subjects were unable in the time allowed to hit upon 
and use effectively any intelligent method, their intelligence 
would count for little or nothing in determining the records, and 
the correlations would be correspondingly low or zero. 

Difference in quickness of adaptation is certainly in evidence 
in Binet's results, one illustration of which is given in the fol- 
lowing table: 





NuMBEK or 
Letters Marked 


Number of 
Errors 




Bright 


Dull 


Bright 


Dull 


1st period of 5 minutes 

2nd " "5 " 

3rd " "5 " 

4th " "5 " 

5th " "5 " 


115 
147 
176 
183 
213 


124 
128 
179 
171 
207 


13.5 
10 

9 

3.5 

6 


43 
32 
15 

8 
7.5 



The improvement of the dull pupils in this cancellation test 
shows itself particularly with respect to the quality of the work. 
But difference in quickness of adaptation would seem a priori 
to be a characteristic applying to the learning of something of 
which both groups are capable, rather than to the factor of 
fundamental importance, differentiating the intelligent from the 
unintelligent. Of course a lower type, of mind does adapt itself 
more slowly than a higher type of mind, but a more funda- 
mental difference between the inferior type of mind and the su- 
perior type, exists in the fact that there are certain kinds of 
performance to which the inferior mind is relatively incapable 
of adapting itself at all, such as the higher forms of abstraction. 
The tests here used by Binet are not such as to bring out this 
difference with any degree of clearness. They are largely along 
the line of perception and memory. The nearest approach to 
testing facility in the higher mental processes akin to abstrac- 
tion is in connection with the copying tests. The fact that there 
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is so decided a difference between the two groups with regard 
to the logical grouping of phrases in copying the easy and the 
difficult prose passages, suggests a difference which does not 
belong primarily under the category of quicker or slower 
adaptation. 

The method of this investigation and the tests used are sug- 
gestive, and worthy of leading to a more extended investigation 
along this line; and it would surely be worth while for some- 
one to take those of the tests found to be most satisfactory, 
and see what their reliability is with a larger number of subjects. 
Then if correlations between the different tests were calculated, 
more definiteness would be given to the work already begun. 
However, what we need most of all is the devising of tests of 
different forms of the highest mental processes — ^performances 
such that the duller minds cannot learn them so readily as they 
do feats of perception. 

In copying the easy material the bright pupils copied 4.5 
words at a time and the dull pupils 3.4; in copying the dif- 
ficult material the bright copied on the average 2.8 words and 
the dull 2.4. The grouping of phrases was far more logical with 
the bright than with the dull. On the other hand, in the copy- 
ing of nonsense material the dull pupils did fully as well as the 
bright. The difference here would seem to be one of appercep- 
tion. When it came to copying nonsense material both groups 
were equally at a loss, as neither had at hand any means of or- 
ganizing the material. Doubtless continued practice at this 
would have revealed a quicker adaptation, and an earlier rise 
in the curve of learning in the case of the bright pupils than in 
the case of the dull, owing to their being quicker to find a 
method of organizing the new material. However, we could not 
hope to arrive at the true correlations of an ability with general 
intelligence if we took our individual records at a point in the 
learning curve where the dull group were still practically at 
zero efficiency, while the bright group had just made a sudden 
rise. Any such decided unevenness in the learning curve, un- 
less eliminated by averages, would seriously interfere with the 
securing of the true correlation between the characteristic 
ability of the group in that capacity, and general intelligence. 

Binet finds that in tactile sensibility the intelligent are de- 
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cidedly superior in the first trials to the unintelligent. It is 
questionable here whether quickness in comprehending the in- 
structions was not a prominent factor in the results secured. 
When the dull can, in so small a number of practice trials, al- 
most equal the bright, who have meanwhile had an equal number 
of practice trials, it seems evident that the ultimate and real 
difference between the two groups in this capacity is not a large 
one. The difference with respect to speed of adaptation is a 
general factor that would tend to cover up or disguise the real 
correlations between different capacities themselves, and not aid 
materially in the detailed diagnosis of intelligence. It is sug- 
gestive, too, that in some of the tests the dull pupils did as well 
as the bright. May this not mean simply that the correlations 
between this sort of ability and general intelligence are too small 
to be brought out by such crude measurements and treatment of 
them as Binet employed ? 

Hence to speak of the difference between inferior and su- 
perior minds as fundamentally a difference in power of volun- 
tary attention, seems misleading in that it is too general a state- 
ment. It suggests that voluntary attention is a capacity that 
can be applied equally well in any desired direction, to the ex- 
tent of one's general ability. It errs in implying that there are 
not various kinds of volvtntary attention, that do not correlate 
perfectly with one another. On the contrary, there are many 
varieties of voluntary attention — if we are to express the facts 
in Binet's terminology — for one might give a degree of atten- 
tion to music that he could by no means give to mathematics 
or to painting. Difference in quickness of adaptation there cer- 
tainly is, and this difference is one of no little significance; but it 
is one of degree. A more significant difference between inferior 
and superior minds is one so pronounced as to suggest a dif- 
ference in kind, rather than one in degree merely. 

By this we mean that there are certain kinds of mental 
feats that can be performed by the able mind that can scarcely 
be performed by the inferior grade of mind at all, let its pos- 
sessor practice at it as much as he will. This sort of distinc- 
tion is not suggested by Binet's way of stating it — that the de- 
termining factor is a matter of voluntary attention. 
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Spearman and Krueger ('06) published the results of an in- 
vestigation on the correlations existing among the following 
mental abilities: Touch Discrimination, Tone Discrimination, 
Adding, Learning by Heart (a series of numbers), Ebbinghaus 
Mutilated Text. Their subjects were eleven advanced univer- 
sity students, four of whom did not speak German as their 
mother tongue. 

As to correlation between the two trials of the same test, 
Spearman and Krueger's coefficient for Adding is 76; ours are 
91 for the combined groups, 76 for the Good, and 90 for the 
Poor. This agreement is close. In the Ebbinghaus test their 
coefficient of reliability is 76 ; ours are 92, 96, and 90 — consider- 
ably higher doubtless on account of the greater number of trials 
and the method of scoring. In Sensory Discrimination their re- 
sults on Touch Discrimination and Tone Discrimination may be 
compared with ours on Drawing Lines. For Touch Discrimina- 
tion their coefficient is low, namely 42. For Tone Discrimina- 
tion their coefficient is 87, as compared with ours for drawing 
lines 72, 42, and 95. Their coefficient for Learning by Heart, 
92, is a good deal higher than ours for Unrelated Words, 73, 
48, 49; and somewhat higher than ours for Memory of Pas- 
sages, 90, 78, 83. 

It is well that our attention has been called to the fact that 
very different results may be obtained by two different experi- 
menters using the same tests. However, the fact that there is 
such close correspondence as to reliability coefficients, where 
they are at all comparable, would go to show that when the test 
is itself satisfactory, and where the experimenter understands 
his business, there is no necessity that the investigations be con- 
ducted by two different experimenters. This precaution applies 
rather to the use of a new and untried test, not to one whose 
method of procedure is clearly understood, provided it is other- 
wise reliable as a test. Once the conscientious investigator is 
aware of the possibility of inaccuracy from such a source, he 
should be able, with our present knowledge of former errors 
in the conduct of mental tests, to secure by himself results that 
will not be vitiated by the personal equation. 
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The Pearson coefficients of correlation which they obtained 
among the abiHties tested are as follows : 



Learn- 
ing by 
Heart 



Raw Coefficients: 

Adding 

Ebbinghaus test 

Tone Discrimination . . , 
Touch Discrimination . . 
Learning by Heart 

COEEECTED COEFFICIENTS 

Adding 

Ebbinghaus test 

Tone Discrimination . . . 
Touch Discrimination . . 
Learning by Heart 
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Thus between Adding and Ebbinghaus test they get a correla- 
tion of 79 raw, and 70 corrected; while ours are 71, 61, 65 
raw, and 71, 55, 63 corrected. It is interesting to note that with 
their results as with ours, the coefficient of correlation becomes 
lessened when corrected. One would expect their coefficient of 
correlation to correspond approximately with ours for the Good 
group. It may be, however, that their group represents persons 
of considerably greater variability than our good group, in 
which case the correlation would be higher. In one respect at 
least their results would be much less reliable than ours, namely, 
in the number of subjects tested. They used eleven subjects 
for the Adding test and only seven for the Ebbinghaus test, the 
foreigners being excluded. Then, too, it may be that as the sub- 
jects were not all of the same nationality, the correlation of 
Adding with other capacities may be largely influenced by dif- 
ferences of practice in adding, characteristics of the different 
countries represented. 

Spearman and Krueger did not give the corrected coefficients 
for Learning by Heart and the other four abilities tested. They 
put them down as zero because they are less than five times the 
P. E. Their raw coefficients are on the whole much lower than 
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the correlations we find between Memory of Words and Adding, 
Ebbinghaus test, and Discrimination of Lengths, as will be seen 
in the following : 

Raw Pearson Coefpicients — According to Our Results 



Combined 


Goods 


49 


21 


94 


54 


21 


—01 


28 


19 



Poors 



Memory of Words and Adding 

" " " Ebbinghaus test 

" " " Drawing Lengths 

" " " Estimating Lengths. . 



22 
66 

05 
04 



Spearman and Krueger conclude that there are beyond doubt 
positive and fairly high correlations between abilities as varied 
as Discrimination of Tones, Adding, and the Ebbinghaus test; 
and on the basis of Oehm's data, between Speed of Reading, of 
Writing, and of Counting; and that the size of these different 
correlations is to be explained on the basis of the degree of 
their connection with a hypothetical, common central factor. 
They think there is good reason for believing that the central 
factor is not to be explained by individual differences in the 
zeal of the subjects, their momentary disposition, their being 
accustomed to the conditions of the experiments, their ability to 
make the most of help given, or to the power of their attention. 
They think that the explanation is in all probability psycho-phys- 
ical, consisting in the fact that one nervous system is more 
highly plastic than another, and that this would be the condition 
for the development of more precise and constantly functioning 
complexes of conduction, which would make possible greater 
quickness and accuracy. 

The suggestion that the explanation of differences in intelli- 
gence is psycho-physical, in itself explains nothing. Of course 
mental differences are determined by neural factors. Beyond 
this guarded general suggestion Spearman and Krueger scarcely 
seem to venture in the way of definite positive statement, except 
to state that the size of the different positive correlations is to 
be explained on the basis of the degree of connection with a 
hypothetical common central factor. 

Spearman ('04) reached conclusions that he considered very 
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important for the measurement and diagnosis of general intelli- 
gence. " On the whole then we reach the profoundly impor- 
tant conclusion that there really exists a something which we 
may provisionally term General Sensory Discrimination, and 
similarly a General Intelligence, and further that the functional 
correspondence between these two is not appreciably less than 
absolute." Also, "... the common and essential element 
in the Intelligences wholly coincides with the common and es- 
sential element in the Sensory Functions." While these state- 
ments, taken in connection with the context, seem perfectly clear 
as to their meaning, it is evident that Spearman does not now 
hold to this latter statement as originally worded.^ 

That there is practically perfect correlation between Sensory 
Discrimination and General Intelligence certainly cannot be 
ijiaintained. Dr. E. L. Thorndike ('09) investigated the rela- 
tionship between accuracy in Sensory Discrimination and Gen- 
eral Intelligence. He took two groups of subjects: (i) 37 
young women students in a normal school, (2) 25 boys in their 
third year in high school. For tests of Sensory Discrimination, 
Thorndike took 90 trials of each individual's accuracy in draw- 
ing lines to standards of 100, 75 and 50 mm., and 16 trials of 
each subject in weighting boxes to standards of 100 and 200 g. 
As a measure of General Intelligence, each girl was rated for 
general intelligence by all the rest, and also by 8 of the pro- 
fessors in the normal school. The scholastic records of the 
girls were also used. The boys were similarly rated by 6 of 
their fellows and by 4 professors. The tests were given on 
different days to eliminate common disturbing factors. The 
median deviation in age for girls was 10 months, and for the 
boys I year. 

The results are summed up in the form of Pearson coefficients 
of correlation (raw) as follows: 



" On page 165 of Cyril Burt's article cited below, footnote 3, he says : 
" With reference to my criticism of the passage cited above (p. 159) 
formulating his view of the relation of General Sensory Discrimination 
and General Intelligence, Dr. Spearman has written me: 'This conclu- 
sion of mine was badly worded. I did not mean (as others have naturally 
taken it) that general intelligence was based on sensory discrimination; 
if anything, vice versa. I take both the sensory discrimination and the 
manifestations leading a teacher to impute general intelligence to be based 
on some deeper fundamental cause, as sketched in the Zeitschrift fur 
Psychologie, Vol. XLI. p. no, par. 5.'" 
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Discrimination of lengths, 1st half with 2nd half 

Discrimination of weights, 1st half with 2nd half 

Pupils' impressions of intellect, 1st half with 2nd half 

Teachers' impressions of intellect, 1st half with 2nd half. . 

Academic record, 1st half with 2nd half 

Combined teachers' and pupils' impressions, 1st half with 
2nd half 



Thus the measures of the two phases of Sensory Discrimina- 
tion and the estimates of General Intelligence were satisfactory 
as to reliability. The inter-correlations found were: 






Women 


Boys 


Scores for all lines with those for all weights 

Scores for all lines with pupils' impressions of intellect . . 

Scores for all lines with teachers' impressions of intellect . 

Scores for all lines with teachers' and pupils' impressions 

of intellect 


52 
25 
12 

23 
08 

85 
16 


25 
0^ 


Scores for all lines with academic record 

Scores for all weights with pupils' impressions of intellect 
Scores for all weights with teachers' impressions of intellect 
Scores for all weights with teachers' and pupils' impres- 
sions of intellect 


—01 

20 


Scores for all weights with academic record 


21 


Pupils' impressions of intelligence with teachers' impres- 
sions 




Pupils' and teachers' impressions with academic record. . 
Combined weights and Unes with pupils' and teachers' 


54 


Combined weights and lines with pupils' and teachers' 


14 







Hence the coefficient, corrected by the Spearman methods, be- 
tween 1st, factor common to accuracy in lines and weights, and 
factor common to pupils' and teachers' impressions of intelli- 
gence, is 20; 2nd, factor common to accuracy in lines and 
weights, and factor common to combined teachers' and pupils' 
impressions and academic record, is 25. According to this, the 
most probable correlation between General Sensory Discrimina- 
tion and General Intelligence would be about 23. 

Burt's results on this point (cited below) are difficult to in- 
terpret on account of the somewhat conflicting results in the 
different groups. If, however, we permitted ourselves to take 
the average of Burt's correlations between different forms of 
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t 
Sensory Discrimination and Estimated Intelligence, we find it to 
be about 21. 

There still remains Spearman's provisional theory of a 
hierarchy of mental functions as the explanation of the fact of 
general intelligence. " Wherever branches of intellectual ac- 
tivity are at all dissimilar, then their correlations with one 
another appear wholly due to their being all variously saturated 
with some common fundamental Function (or group of Func- 
tions)." " . . . the remaining or specific elements of the 
activity seem in every case to be wholly different from that in 
all the others." 

According to this theory, no two mental functions could be 
more closely related to one another than each is related to the 
common central factor. For instance in our -results. Memory 
of Words and Memory of Passages should relate no more 
closely to each other, and the A test and Geometrical Forms no 
more closely to each other, than the element common to Memory 
of Words and Memory of Passages relate to the element 
common to the A test and Geometrical Forms. This can be 
very readily tested by the use of the correction formula, 

Rpiq^ + Rpiq2 + Rp2qi + Rp2q2 

Rpq = 4 , where Rpq represents 

^Rp,p, X Rq,q, 
the correlation between the common factor in p^ and p^, and the 
common factor in q^ and gj/ ^Pi^i represents the correlation be- 
tween the ability tested by, say, Memory of Words and the 
ability tested by the A test; Rp^q^ represents the correlation be- 
tween the ability tested by the Memory of Words test and the 
ability tested by the Geometrical Forms test; Rp^Qi represents 
the correlation between the abilities tested by Memory of Pas- 
sages and the A test; and Rp^q^ represents the correlation be- 
tween the abilities tested by Memory of Passages and the 
Geometrical Forms test. 

Then taking the estimated true coefficients of correlation 
from Table XIV, and substituting the values in the above equa- 

54 -F 49 -h 46 + 33 
tion we get, Rpq = 4 =54- 

V80 X 87 
which is much lower than Rpip2> 80, or Rq^q^, 87. 
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That is, Memory of Words and Memory of Passages are more 
closely related one to another, and A test and Geometrical Forms' 
are more closely related one to another than the factor common 
to the first two is to the factor common to the second two. 

Similarly facts for the abilities tested by the Ebbinghaus and 
Hard Opposites tests, and the A and Geometrical Forms tests, 
letting pip^Qx and g^ refer to these in order, are that 

54 + 36+58 + 42 
Rpq = 4 =55, which is much below 85 or 87. 

V8s X 87 

Again on the basis of Spearman's theory of a common central 
factor, if the coefficients of correlation among a number of abili- 
ties are arranged in descending order, from left to right and 
from top to bottom as are the ones he obtained on page 86 
above, in every line the figures should be in descending order as 
they are on the top line and on the vertical line on the left. 
According to our results, secured from 37 subjects instead of 
only II, this is by no means the case, as is evident from the fol- 
lowing from Table XIV, p. 63. 
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Thus it is quite evident that Spearman's theory is not in har- 
mony with the facts we have secured. 

Cyril Burt ('09) tested 43 boys between the age,s of 12 years 
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6 months and 13 years 6 months, one group of 13 boys from a 
high class College Preparatory School, and the remaining group 
of 30 from an Elementary School. The tests used can be 
grouped as follows: 

I. Sensory Discrimination. 

Touch, weights, pitch, lengths. 
II. Motor Tests. (Simple reactions). 
Tapping, card dealing. 

III. Sensory Motor Tests. 

Card sorting, alphabet finding. 

IV. Association Tests. 

Immediate memory — concrete words, abstract words, 

nonsense syllables 
Mirror test — formation of motor association by trial 

and error. 
Spot pattern test — apperception of a form composed 
of dots. 
V. Test of Voluntary Attention. 
Dotting irregular dots. 

The correlations found by Burt are as follows: 



Corrected Coefficients 

for 

Elementary School 
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Spot Pattern 

Dotting 

Mirror Tracing 

Alphabet 

Tapping 
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Dealing 

Sorting 
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Touch 
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26 

37 
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68 


64 




75 


-61 


94 


40 


71 


47 


62 


83 


50 


66 


84 


75 




40 


85 


-29 


10 


32 


-17 


37 


36 


06 


06 


-61 


40 




-07 


-51 


-40 


07 


45 


32 


57 


44 


27 


94 


85 


-07 




21 


15 


-32 


31 


-48 


09 


-17 


27 


40 


-29 


-51 


21 




61 


-33 


25 


-06 


74 


-20 


22 


71 


10 


-40 


15 


61 




-62 


63 


35 


44 


41 


27 


93 


40 


-06 


23 


26 


53 



Sorting 

Dotting 

Alphabet 

Tapping 

Imputed Intelligence . 

Memory 

Mirror 

Spot Pattern 

Dealing 

Lines 

Touch 

Weight 

Soimd 



-62 
63 
35 
44 
41 
27 
93 
40 

-06 
23 
26 
53 



The above are arranged approximately in descending order, 
but it is very evident that they even more strongly contradict 
Spearman's original theory of a hierarchial arrangement than 
do our results spoken of above. Apparently, however, Spear- 
man himself has abandoned or modified this view. In his 
demonstration of a " Hierarchy of Coefficients," Burt uses the 
raw coefficients from the amalgamated measurements, rather 
than the corrected coefficients which we have quoted. In this 
connection he says : " Dr. Spearman and Prof. Krueger imply 
that satisfactory hierarchies are exhibited only by the ' pure ' or 
theoretical coefficients ; but it appears that those based on amal- 
gamated measurements are better than those based on theoreti- 
cal ' correction,' if the experimental conditions are carefully con- 
trolled." ..." The theoretical values for the ideal hier- 
archy may be obtained by various mathematical formulae." 
" The following simple formula has been supplied for this pur- 
pose by Dr. Spearman (to whom I am here particularly in- 
debted for several improvements on my own demonstration of a 
hierarchy). . . ." But when the raw coefficients fail to ac- 
cord with the theory, and when the corrected coefficients fail to 
accord with the theory, and when coefficients from amalgamated 
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series of measurements fail to accord with the theory, why not 
abandon the theory! Surely this would be better than to re- 
sort to further manipulation of data influenced to some extent 
by variables at present uncertain, and consequently to that extent 
untrustworthy. Only when the variables themselves are fully 
and definitely understood can such methods be used with any 
degree of safety. 

Burt does not give the correlations between memory of con- 
crete words, memory of abstract words, and memory of non- 
sense syllables. In comparing his results with ours we shall 
take his data for the boys of the Elementary School, both be- 
cause the greater number of subjects in this group makes the 
results more reliable than in the case of the other group, and 
because the group itself is probably more representative of 
average boys than are boys in a College Preparatory School. 
Perhaps the fairest comparison to make would be between his 
coefficients for the Elementary School Group, and our estimated 
true coefficients, though one would expect the latter to be some- 
what higher. 

Burt's corrected coefficient for Memory and the Spot Pat- 
tern test is 41 ; between Memory of Words and Learning Pairs, 
we get for the Poor group 44, and for the estimated true co- 
efficient 65. The agreement here is as close as could be ex- 
pected. 

As to correlation between Sensory Discrimination and 
Memory, Burt gets a correlation between Discrimination of 
Lines and Memory of 05; between Drawing Lengths and 
Memory of Words we get -09 for the Poor group, and -05 as 
the estimated true correlation. 

Bonser ('10) gave tests in reasoning and selective thinking 
to 385 boys and 372 girls of the Fourth, Fifth and Sixth school 
grades of public schools in Passaic, New Jersey. " The tests 
employed were made up of a series of problems and questions 
designed to exercise the most fundamental four phases of 
reasoning activity, namely: The mathematical judgment; con- 
trolled association; selective judgment; and that complex of 
analytic and synthetic thinking used in the intellectual interpre- 
tation of literature. 
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Mathematical Judgment 

" The problems for testing the mathematical judgment were of 
two kinds, two sets of five each, I, A and B, stated in the form 
usually followed in current text-books in arithmetic; and two 
sets of five each, II, A and B, of the same difficulty as the pre- 
ceding in processes involved but stated in a less conventional 
way. Each of the ten problems of the first type may be called 
a " two-step " problem — it requires a preliminary operation for 
securing the intermediate datum necessary before the final 
operation can be accomplished. 

Tests I and II 
" LA. Get the answers to these problems as quickly as you 



can. 

I 
2 



If J4 of a gallon of oil costs 9 cents, what will 7 gallons cost? 
John sold 4 sheep for $5 each. He kept }i of the money and with 

the other ^ he bought lambs at $2 each. How many did he buy? 
A pint of water weighs a pound. What does a gallon weigh? 
At 1254 cents each, how much more will six tablets cost than 10 

pens at 5 cents each? 
At 15 cents a yard, how much will 7 feet of cloth cost? 



B. 

I. 



A man whose salary was $20 a week spends $14 a week. In how 

many weeks can he save $300? 
How many pencils can you buy for 50 cents at the rate of 2 for S 

cents ? 
A man bought land for $100. He sold it for $120, gaining $5 an 

acre. How many acres were there? 
A man spent % of his money and had $8 left. How much had he at 

first? 
The uniforms for a baseball nine cost $2.50 each. The shoes cost $2 

a pair. What was the total cost of uniforms and shoes for the 

nine? 

II. A. 

I. 132 plus what number equals 36? 

2 If John had 15 cents more than he spent today he would have 40 
cents. How much did he spend today? 

3. What number minus 7 equals 23? 

4. If James had 4 times as much money as George, he would have $16. 

How much money has George? 

5. What number added to 16 gives a number 4 less than 27? 
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B. 

1. What number subtracted 12 times from 30 will leave a remainder 

of 6? 

2. If a train travels half a mile a minute, what is the rate per hour ? 

3. What number minus 16 equals 20? 

4. What number doubled equals 2 times 3 ? 

5. If 7 multiplied by some number equals 63, what is the number? 

" In the original blanks, immediately following each problem 
space was left for its solution. 

Controlled Association 

" For controlled association, three types of tests were used. 
First, two sets of ten sentences each. III, A, a and b, were given 
with a significant word omitted from each to be filled in by the 
pupil. Second, two sets of ten sentences each. III, B, a and b, 
were given in each of which two significant words were placed, 
one above the other, one giving a correct meaning to the sen- 
tence, the other an erroneous meaning, the pupil to draw a line 
through the wrong word leaving the sentence so that it would 
read correctly. Third, three sets of twenty words each, IV, A, 
B and C, were given to pupils, they to write beside each respec- 
tive word a word just its opposite in meaning — the familiar 
" opposites " test. 

Tests III and IV 

" III. A. a. Complete the following sentences as quickly as 
you can by filling the blank spaces with appropriate words : 
I always comes in the last week in December. 

2. A is one who plays a musical instrument. 

3. The city is in Russia. 

4 are large, visible bodies of watery vapor floating about 

in the air. 

S used for building houses are made of clay. 

6. The machine on a railroad for drawing cars is an 

7 is the most useful metal for blacksmiths. 

8 live and swim about in the water. 

g. Most light summer clothing is made of goods. 

10 is a holiday. 

III. A. b. 

1. The flesh of cattle used for food is called 

2. The months are June, July and August. 

3. The makes it light during the day. 
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4 catch many mice and birds. 

5- A is a large stream of water flowing through the land. 

6. Men who live in the country and till the soil are called 

7 is a mineral which we burn. 

8. The Ocean is east of the United States. 

9 sell sugar, vegetables and other foods. 

10. There are hours in half a day. 

III. B. a. As quickly as you can, make these sentences cor- 
rect by drawing a line through the wrong word where two words 
occur, one above the other : 

longer 

1. Days are in summer than in winter. 

shorter 

up 

2. Water always flows hill. 

down 
more 

3. Glass breaks easily than tin. 

less 
earlier 

4. The sun rises in January than in July. 

later 
softer 

5. Iron is than wood. 

harder 
warmer 

6. It is in Florida than in Maine. 

colder 

heavier 

7. Anjrthing that floats is than water. 

lighter 
more 

8. Oranges grow satisfactorily in California than in New Jersey. 

less 
shorter 

9. Shadows are in summer than in winter. 

longer 
more 
10. Plants grow readily in warm sunshine than in the cool shade, 

less 

III. B. b. 

stronger 

1. Men are usually than women. 

weaker 

less 

2. A pound of iron is worth than a pound of copper. 

more 
before 

3. Christmas comes Thanksgiving day. 

after 
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IV. 



Cotton ( 


:lothing is 


than wool, 
cooler 




Less 








coal is used 


in summer than in winter. 




More 


poorer 






Bankers 


are 

richer 


than cab drivers. 




More 








1 


dorses than mules are used for driving 


; purposes. 


Fewer 


more 






There are teachers than preachers, 
feiver 






more 






Oranges 


; are 

less 


sweet than lemons. 




More 








bread than cake is eaten in this city. 




Less 








. As quickly as 


you can write beside each of thes 


rd that 


means exactly its opposite. 




A. 




B. 


C. 


day- 




great 


bad 


asleep 




hot 


inside 


absent 




dirty 


slow 


brother 




heavy 


short 


best 




late 


little 


above 




first 


soft 


big 




left 


black 


backwards 


morning 


dark 


buy 




much 


sad 


come 




near 


true 


cheap 


; 


north 


dislike 


broad 




open 


poor 


dead 




round 


well 


land 




sharp 


sorry 


country 




east 


thick 


tall 


t 


known 


full 


son 




something 


peace 


here 




stay 


few 


less 




push 


below 


mine 




nowhere 


enemy 



Selective Judgment 

" Two types of tests were used for selective judgment. First, 

two set^, y, A and B, of two series each of ten reasons why 

some given fact is true, some of which reasons are correct, the 

others incorrect or irrelevant, were given. The pupil was to 
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select, by checking, the correct reasons. Second, there were 
given similarly two sets, IV, A and B, of three series each, of 
five definitions for a given thing or term, some of which were 
correct, the others incorrect or irrelevant: 

Tests V and VI 

" V. A. The following reasons have been given to show why 
New York has become a larger city than Boston. As quickly 
as you can, place a cross like this, +, before each reason you 
think a good one: 



New York is on an island. 



More foreigners live in New York than in Boston. 
New York is on a large river coming from a rich agricultural region. 
Mr. Rockefeller has a fine home in New York. 
New York has more churches than Boston. 

New York has better communication with the States lying to the 
west. 

7. New York has elevated railroads. 

8. New York is in the midst of a rich fruit and agricultural district. 

9. New York is nine or ten years older than Boston. 
10. New York has a republican governor. 

B. These reasons have been given to show that oak is better 
than pine for making furniture. Check the good reasons. 

I. Oak wood is harder than pine. 

3. Oak trees have acorns, pine trees do not. 

3. Oak wood takes a finer polish than pine. 

4. Oak trees have more beautiful leaves. 

5. Oak trees make good homes for squirrels. 

6. Pine wood will not last so long as oak. 

7. Pine is more easily dented and defaced than oak. 

8. When polished and varnished, oak is much more beautiful than pine. 

9. Pine trees are sometimes used for Christmas trees. 
ID. Oak trees are easier to climb than pine trees. 

C. The following reasons have been given to show why 
oranges grow better in Florida than in New Jersey. Check the 
good reasons. 

I. There are many negroes in Florida who work very cheaply. 



Florida has warm summer weather almost the whole year. 

There are no alligators in New Jersey. 

Florida very rarely has hard frosts. 

New Jersey is not so large as Florida. 

Florida was settled earlier than New Jersey. 

New Jersey grows many fine peaches. 

Florida has a very moist, warm climate. 
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9. Florida is a word meaning the land of flowers. 
10. Florida is a popular winter resort. 

D. Among these reasons why horses are better than cattle 
for driving and working animals, check those which you think 
are good reasons. 

1. Horses are more intelligent than cattle. 

2. Cattle are not so tall as horses. 

3. Horses like corn, oats and hay. 

4. Horses are much more active and walk faster than cattle. 

5. Cattle are extensively used for food. 

6. Horses are much more beautiful and graceful than cattle. 

7. The skins of horses are sometimes made into gloves. 

8. Horses are more easily trained and controlled than cattle. 

9. President Roosevelt likes to ride on horseback. 

10. Horses have more rapid and varied gaits than cattle. 

VI. A. In the following definitions, place a small cross, like 
this +, before those which you think are good ones, doing it as 
quickly as you can. 

a. Definitions of a shoe. 

1. A portion of clothing. 

2. Something black made of leather. 

3. A protective covering for the feet, usually made of leather, 

having a firm bottom or sole and flexible upper portions, an 
opening for the foot being fastened by lacings, buttons or 
buckles. 

4. Something to wear on the feet. 

5. A necessary article costing from one to five or six dollars. 

b. Definitions of an island. 

1. A piece of land out in the water. 

2. A small body of land. 

3. A body of land entirely surrounded by water. 

4. Cuba is an island. 

5. A portion of land rising above the surrounding level. 

c. Definitions of to explode. 

1. To burst suddenly with a loud noise. 

2. To knock all to pieces. 

3. To make a very loud noise. 

4. To fill the air with a tumultuous roar. 

5. To blow up. 

a. Definitions of a chair. 

1. A piece of household furniture. 

2. A movable seat with a back intended for one person. 

3. A piece of furniture on which to sit. 

4. Rocking chairs are comfortable chairs. 

5. A single seat having a back. 
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b. Definitions of to write. 

1. To make marks with a pen or pencil. 

2. To make characters which stand for ideas. 

3. To use a pen or pencil. 

4. To make marks on any kind of surface with any kind of an 

instrument which will express one's ideas so that another 
may understand them. 

5. To write a letter. 

c. Definitions of a buggy. 

1. A buggy is black. 

2. A buggy is something to ride in. 

3. A buggy is a light, four wheeled vehicle, with or without a 

top or covering, designed for carrying two or three persons. 

4. A buggy is drawn by horses. 

5. A buggy may have rubber tires. 

Literary Interpretation 

" For literary interpretation, two stanzas of poetry, VII, A and 
B, were used, the pupil to write the meaning of each in his own 
words. These poems are taken from a third reader and a 
second reader respectively, each from a different standard series 
published within a decade of the time of these tests. 

Test VII 

" VII. A. Read carefully the following stanza, then write its 
meaning in your own words. 

' This little rill, that from the springs 
Of yonder grove its current brings. 
Plays on the slope awhile, and then 
Goes prattling into groves again. 
Oft to its warbling waters drew 
My little feet, when life was new.' 

B. Read carefully the following stanza, then write its mean- 
ing in your own words : 

' Under the greenwood tree, 
Who loves to lie with me. 
And tune his merry note 
Unto the sweet bird's throat. 
Come hither, come hither, come hither, 
Here shall he see 
No enemy 
But winter and rough weather.' 
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Spelling 

" As an incidental problem for correlation, the opportunity 
offered for a test in spelling was taken. Two papers, B and C, 
from test V, the opposites test, were graded in spelling for each 
pupil. As the pupils did not know that the papers were to be 
graded in spelling, it had little of the disadvantages of the 
formal spelling test, yet the words were practically predeter- 
mined and uniform." 

Bonser's results are given in part in the following table: 

TABLE XX 

Averages op Coefficients of Cokrblation by Grade, by Age, 
AND for Both Grade and Age 



Boys 


, 385 


Girls 


,372 


By 


By 


By 


By 


Grade 


Age 


Grade 


Age 


39 


58 


32 


34 


33 


49 


31 


53 


21 


44 


17 


52 


10 


38 


31 


25 


18 


48 


08 


32 


38 


69 


58 


70 


16 


34 


20 


25 


55 


57 


54 


47 


38 


45 


35 


35 


25 


31 


21 


16 


40 


48 


43 


39 


62 


61 


58 


40 


27 


35 


21 


07 


41 


65 


34 


56 


35 


45 


30 


25 


26 


37 


28 


35 


87 


87 


81 


85 


04 


29 


25 


28 


45 


54 


36 


53 


22 


40 


09 


34 


70 


79 


62 


80 


07 


18 


06 


29 


23 


28 


11 


32 


60 


63 


59 


50 


03 


18 


04 


13 


32 


49 


27 


39 


27 


25 


17 


30 


02 


30 


23 


32 



Total, 757 



By Grade 
and Age 



I-II and III 

" " IV 

" " V 

" " VI 

" " VII 

" " Total ... 

" " Spelling. 

III and IV 

" " V 

" "VI 

" " VII 

" " Total ... 

" " Spelling. 

IV and V 

" " VI 

" " VII 

" " Total ... 

" " Spelling. 

V and VI 

" " VII 

" " Total... 

" " Spelling. 

VI and VII 

" " Total . . . 

" " Spelling. 

VII and Total ... 

" " Spelling. 

Total and Spelling 



41 
42 
33 
26 
26 
59 
24 

53 
38 
24 
45 
55 
22 

49 
34 
32 

85 
21 

47 
26 
73 
12 

24 
58 
09 

37 
25 

22 
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Bonser concludes that the tests " are valid measures of sev- 
eral phases of that complex capacity we call reasoning ability. 
Group correlations are so much higher among these tests than 
among those so far produced among mental abilities more varied 
in kind that we are clearly justified in holding them to be tests 
of abilities which are varieties of one general species of 
ability." 

Bonser's coefficients of correlation are all obtained by the 
method of like and unlike signs, and are uncorrected for at- 
tenuation. " The highest coefficients as shown by the averages, 
are, in their order, that for tests III and IV, the two forms of 
controlled association, — completing sentences and opposites — 
53 ; that for tests IV and V, the opposites and the selection of 
reasons in one of the tests in selective judgment, 49; that for 
V and VI, the two tests in selective thinking, 47; and that for 
III and VII, controlled association and interpretation of poems, 
45. By correction these would all be raised to above 75, those 
for III and each of the others approaching 100 very closely." 
This last statement is made on the ground of a few coefficients 
of correlation that have been corrected by two different methods, 
and it must be remembered of course that this is only a rough 
estimate. However, the general fact that the correlations of the 
different tests of selective thinking with one another are rela- 
tively very high, is in accord with our results as shown in Tables 
IX, X, and XIV. 

Bonser further concludes that " the results here derived point 
to the conclusion that the correlations among the abilities here 
tested are a matter of native capacity rather than the result of 
training." This follows mainly from the facts that, (i) the cor- 
relations on the basis of age are considerably higher than those 
on the basis of grades ; (2) from the fact that the median age of 
the best 10 per cent and of the poorest 10 per cent is nearly the 
same, while in the scores obtained in the tests, they differ from 
three hundred to three thousand per cent. Moreover the chil- 
dren had had almost no training in the exact type of problems 
as set, in the opposites test, and in the form of selective judg- 
ment of test V, yet these stand, in this order, in the highest cor- 
relation to the total ability shown for all of the tests — all of 
which suggests that these abilities have not been developed as 
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by-products of school training, but that the results of the tests 
are measures of the native capacity of the children for the ac- 
tivities required in these problems. 

There is however a disturbing factor entering into Bonser's 
results which is worthy of consideration. It arises from the 
fact that in all tests except that of Opposites, the time element 
was not kept constant for all. All of the pupils were stopped at 
the moment when the first pupil of the room had just finished. 
Bonser does not state the time record in the different rooms, 
nor take it into consideration in scoring the results. Presum- 
ably the pupils in the higher grades would thus get less time 
for the work than those in the lower grades, in so far as the 
time taken by the quickest pupil in an upper grade would be 
less than the time taken by the quickest pupil in a lower grade. 
It would thus appear that Bonser's results underemphasize the 
superiority of the upper grades over the lower ones. It would 
tend to favor the younger pupils. While it would not affect the 
size of the correlations by grades, it would probably tend to 
raise somewhat, — and certainly to disturb more or less, — the 
correlations by age. How much this disturbance would amount 
to is difficult to say. It would probably be slight. Even grant- 
ing that it would be considerable, however, Bonser's contention 
that the tests measure native capacity rather than training, 
would still have sufficient to support it in the other arguments 
used, apart from that of the higher correlations by age than by 
grades. This conclusion also is in accord with our results. 

William Brown ('ii) records the results of a somewhat ex- 
tensive investigation " for the purpose of determining to what 
extent correlation exists between certain very simple mental 
abilities in cases where the individuals experimented upon are, 
as near as may be, identically situated with respect to previous 
practice, general training, and environment; and how closely, if 
at all, these elementary abilities are related to general intellec- 
tual ability as measured by teachers' judgments, school marks, 
etc. Every effort was made to keep the groups of individuals 
tested as homogeneous as possible; and instead of measuring 
irrelevant factors and ' correcting ' for them in the later stages 
of the research, the influence of such irrelevant factors was ex- 
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eluded right from the beginning by a rigorous segregation of 
the material, and in other ways. 

" The groups of individuals to which the tests were applied, 
were as follows : 

Group I, 66 boys of a London elementary school, all between 
the ages of 11 and 12. 

Group II, 39 girls of a London elementary school, all between 
the ages of 11 and 12. 

Group III, 40 boys of a London higher grade school, all be- 
tween the ages of 11 and 12. 

Group IV, 56 training college students (women), of the same 
year and of approximately the same age. 

Group Va, 35 university students (men). 

Group Vb, 23 university students (women)." 

The tests employed were selected " not so much for their 
a priori likelihood of showing inter-correlation, as for their con- 
venience in admitting of application to an entire group of sub- 
jects simultaneously and unobtrusively. The following is a list 
of them: 

1. Crossing through letters e and r in a page of print. 

2. Crossing through letters a, n, o, and s in a page of print. 

3. Crossing through every letter in a page of print. 

4. Adding up single digits in groups of ten. Measurement 
of (a) speed, (b) accuracy. 

5. Bisecting ten printed lines (80 mm. long), and putting in 
one of the points of trisection in each of the ten other lines 
(90 mm. long). 

6. Muller-Lyer Illusion. Measurement of (a) size, (b) 
mean variation. 

7. Vertical-Horizontal Illusion. Measurement of (a) size, 
(b) mean variation. 

8. Mechanical Memory (permanent), tested by means of 
nonsense syllables. 

9. Memory for poetry. 

10. Combination test (Ebbinghaus mutilated test). 

In the case of groups I and II, recourse was also had to : 

11. Marks for Drawing. 

12. Total school marks. 
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13. Grading for General Intelligence (two independent 
measures). 

Finally with Groups Va, and Vb, the following test was also 
employed : 

14. Association-time (uncontrolled). Measurement of rate 
of sequence of ideas called up by a stimulus-word." 

The author does not seem to appreciate fully the inaccuracies 
which may creep in and influence the results, when the tests are 
given as group tests rather than to each individual separately. 

With the exception of test (9), and in some cases test (8), 
every test was applied twice, the second test being given about 
a fortnight after the first, and at the same hour of the day. 

It is highly probable that the gain in accuracy owing to the 
comparatively large number of subjects in the different groups 
is more than counterbalanced by the tendency to spurious cor- 
relation due to the tests being given as group tests, and by the 
inaccuracies due to the small number of measurements taken. 
However there is still need of considerable diversity of general 
method in investigations in correlation, to insure adequate cor- 
roboration and verification of results. 

Brown's numerical results are summarized in part in the fol- 
lowing tables : 
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Brown's Results. Pearson Coefficients of Correlation 
(Top line, Group I; 2nd line, Group II; 3rd line. Group III) 
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00 


14 
00 
00 


10 


59 
40 


55 
49 


Memory of Poetry.. 


27 
23 


28 
14 


52 
44 


49 
38 




41 
00 


38 
-11 


12 
19 


13 


60 


57 


Addition (Speed).. . . 


59 
13 
35 


51 
00 
20 


40 

-13 

32 


27 

-13 

00 


41 
00 




13 
24 
33 


25 
33 
20 


00 


00 
28 


10 

24 


Addition (Accuracy) 


30 
00 
00 


24 

00 

-11 


38 

-25 

00 


31 

-23 

00 


38 
-11 


13 
24 
33 




00 
30 
00 


41 


00 
11 


00 
00 


Motor (all letters)... 


53 
49 
25 


21 
21 
GO 


13 
00 

28 


14 
00 
00 


12 
19 


25 
33 
20 


00 
30 
00 




00 


00 
23 


13 
32 


Bisection 


00 


00 


15 


10 


13 


00 


41 


00 












School Marks 


00 
30 


27 
17 


54 
60 


59 
40 


60 


00 

28 


00 
11 


00 
23 






64 
78 


General Intelligence. 


00 

28 


13 
10 


43 
69 


55 
49 


57 


10 
24 


00 
00 


13 
32 


64 

78 
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Brown's Results. Pearson Coefficients op Correlation 
(Top line, Group IV; 2nd line, Group Va) 







^^ 




X 










C3 


^ 


o 








1 




1 
a. 


1 




a 




i 


< 

i 


1 


■3 
1 


.S 


g 
1 




Jl 


T3 


-« 


"o 


s5 






J3 


13 


T3 


05 




^ 




H 


< 


<J 


S 


S 






53 


34 


31 










-16 


19 




19 


33 




53 




43 


20 








-16 




38 




-26 


39 


Addition (speed) 


34 


43 




18 








19 


38 






00 


37 


Mechanical Memory 


31 


20 


18 












Marking er 
















19 


- 26 


00 






18 


Association time 


33 


39 


37 




-18 









The principal conclusions summed up by Brown are : " The 
correlation between different psychical abilities is not very close. 
Few correlations are greater than .60. 

" The size of the correlation coefficient varies greatly from 
one group of subjects to another. This shows how great is the 
danger of spurious correlation, due to heterogeneity of material, 
in psychical measurements. 

" The Combinations-Method of Ebbinghaus is a good measure 
of intellectual ability. It correlates with general intelligence 
almost as closely as scholastic intelligence (school marks) does. 
Mechanical memory correlates fairly closely with intelligence. 
. Correlations may be very low even within a set of 
mental tests which appear to measure closely related mental 
abilities, and this when the reliability coefficients are high. Thus 
the correlation between erasing the letters a, n, o, s, and erasing 
all the letters, is less than three times the probable error in every 
group tested. 

" In homogeneous groups of subjects there is no positive evi- 
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dence of the existence of one ' central factor ' to which the cor- 
relations between the individual mental abilities may be regarded 
as due." 

Conclusions 
A — As to method: 

In general we have found it highly instructive to calculate 
correlations with two contrasting, selected groups, and also with 
the groups combined. The correlations obtained in one group 
can then be used to check up correlations obtained in the other 
groups. In this way, too, we get a far more accurate idea of 
the true amount of the correlations among abilities, for people 
in general. It also shows that cases that have been previously 
interpreted as giving negative correlations, would in all proba- 
bility have given small positive correlations if the group chosen 
were representative not of a selected group, but of people of 
all degrees of mental ability chosen at random. In fact, there 
appear to be few if any negative correlations. 

Much labor has been spent upon the correction of Pearson 
coefficients of correlation by means of the Spearman formula. 
The gain in accuracy thus secured does not seem proportionate 
to the amount of time spent. It would be productive of greater 
results to spend more time in getting as accurate and reliable 
records as possible of each individual's ability in each test, so 
that correction on an extensive scale would not be necessary, 
since in any case correction is valid only when the reliability of 
the records themselves is fairly high. 

B— /ij to facts: 

The most important results are probably the quantitative ones 
presented in Tables IX, X, and XIV, summarizing the correla- 
tion of each test with each of the other thirteen tests used. 

We find justification for the common assumption that there 
is close inter-relation among certain mental abilities, and conse- 
quently a something that may be called ' general mental ability ' 
or ' general intelligence ' ; and that on the other hand certain 
capacities are relatively specialized, and do not necessarily imply 
other abilities except to a very limited extent. 

Of the six varieties of capacity tested, in so far as the tests 
used are representative of them, we find that those most inti- 
mately related to other abilities are (a) selective thinking, (b) 
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memory and association, (c) quickness and accuracy of percep- 
tion, (d) motor control, (e) sensory discrimination, each in 
the order named. 

This in turn throws light upon the qu^tion as to what con- 
stitutes 'general intelligence.' This is a broad term, and may 
be subject to some variation in interpretation. Our Good group 
may not be exactly representative of general intelligence as ex- 
hibited in occupations and callings of a different order from that 
required in leadership in the teaching profession or in an educa- 
tional career. We must also acknowledge limitations in the num- 
ber and variety of tests used. However, subject to these limita- 
tions, we find that ' general intelligence ' implies the different 
abilities tested in the relative order stated in the above para- 
graph — abstract thinking in very high degree, memory and as- 
sociation in less degree, etc. 

We find no justification for the view that 'general intelli- 
gence ' is to be explained on the basis of a hierarchy of mental 
functions, the amount of correlation in each case being due to 
the degree of connection with a common central factor. 

Finally we find that ' general intelligence,' as commonly under- 
stood, can be measured with a high degree of accuracy by the 
use of certain of the tests. In fact, an hour so spent in 
testing an individual gives us a very significant indication of 
his ' general intelligence ' as the term is commonly understood 
and used by well educated people. The time seems not far dis- 
tant when we shall be able to say to a student : " Such and such 
is the order of general mental capacity that we may expect of 
you at the present time. If you do not attain to such and such 
a standard of efficiency, it will be due to other causes than lack 
of mental capacity." 

It would not be difficult to improve at least five of the tests 
used, viz., Hard Opposites, Ebbinghaus test. Easy Opposites, 
Learning Pairs and Recognizing Forms, to administer them in 
such a way as to make them satisfactory in reliability, and to 
secure norms of performance in each of them. These norms 
could then serve as a basis for comparison, enabling us to se- 
cure important information as to significant phases of the gen- 
eral mental capacity of any individual, which would be of great 
practical value to a prospective employer, or to those directing 
the educational career of the person in question. 
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APPENDIX 

The tests used, the names used in this monograph to designate 
them, and the directions given, in brief, in administering them 
were as follows : 

Test I. "A Test" 
I, a. As quickly as you can, mark all the A's. 

I, b. " " " " B's. 
GWBTBVKIKSCSAUEBCIWVABZSMDUBKLWHKHYCGYGK 
NANNCBVBSAKOIUPEKCXVGSTVRIWYBYGKHAZLPBYO 

and IS54 lines more of the same sort. 

Test 11. "Geometrical Forms" Test. (See Fig. i, A.) 

II, a. As quickly a?, you can, mark all the hexagons with the point up, 
thus : ij 

II, b. As quickly as you can, mark all the semicircles with the flat side 
up, thus: O 

Test III. "Scroll Test." (See Fig. i, B.) 

III, a. With the fountain pen given, trace the space between the black 
lines as quickly as possible without touching the black. 

Ill, b. Ditto, going faster or slower than in a, as directed. 
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Fig. I 
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Test IV. "Easy Opposites" Test. 

IV, a. As quickly as possible give orally a word that means the exact 
opposite of each word in the list. 

IV, b. Ditto with second list. 

IV, c. " " third " . 

IV. d. " " fourth " . 



IV, a 


IV, t 


IV. c 


IV, d 


good 


stale 


high 


day 


outside 


hot 


up 


asleep 


quick 


dirty 


wet 


absent 


tall 


heavy 


new 


brothel 


big 


late 


soft 


best 


loud 


first 


wider 


over 


white 


left 


wrong 


big 


light 


morning 


yes 


backwards 


happy 


much 


young 


buv 


false 


near 


brave 


come 


like 


north 


winter 


cheap 


rich 


open 


weak 


broad 


sick 


in 


forget 


dead 


glad 


sharp 


wila 


land 


thin 


east 


beginning 


country 


empty 


sour 


straight 


tall 


war 


something 


raise 


son 


many 


stay 


rough 


here 


above 


push 


love 


less 


friend 


nowhere 


noisy 


easy 



Test V. "Recognizing Forms" Test. (See Fig. I, C, D, E and F.) 

V, a, I (C of Fig. i). You may study this for i minute; then I shall 
tell you to stop. (The general nature of the test and what was to be done 
by the subject were explained before starting.) 

V, a, 2 (D of Fig. i). Mark all the forms exactly the same as those 
seen in V, a, l. (No time limit was required in the marking.) 

V, b, I (E of Fig. i). You may study this for one and a half minutes; 
then I shall tell you to stop. 

V, b, 2 (F of Fig. i). Mark all the forms exactly the same as those 
seen in V, b, i. (No time limit was required in the marking.) 
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Test VI. "Memory of Words" Test. 

VI, a. Write down all the words in the list that you can remember 
after hearing them read once., 

VI, b. Ditto with second list, VI, b, etc. 



VI, a 


VI. b 


VI, c 


VI, d 


picture 


knife 


mouse 


whisper 


silly 


window 


bank 


Columbus 


unless 


peacock 


disease 


necessary 


lizard 


brass 


cheap 


laugh 


book 


weary 


country 


dictionary 


pain 


rich 


study 


cane 


island 


vine 


tooth 


key 


tin 


servant 


musician 


doctor 


literature 


pinch 


pie 


boat 


axe 


wheel 


building 


enough 


run 


hammock 


fruit 


walking 


tomato 


horn 


weapon 


rent 


tired 


pitiless 


spider 


earth 


frost 


crack 


mountain 


canvas 


wide 


beef 


shallow 


carpet 


Indian 


glue 


window 


steam 



Test VII. "Learning Pairs." (See Fig. 2.) 

Study VII, a, i, for one minute so that when VII, a, 2, is given, you 
can write down the corresponding word. Similarly with VII, b, i and b, 2. 
Similarly with VII, c and d (except that one and a half minutes were 
allowed for study instead of one minute). For Fig. 2, the lists of pairs 
are given in order. 

Test VIII. "Memory of Passages." 

Write down all that you can remember of the substance of the passage 
after hearing it read once. 



VIII, a, Memory of Passages 
It isn't necessary to read a book in order to be happy with it. On a 
steamer or in a hammock you simply have to have the book in your lap 
or close at hand, with the paper-cutter and pencil. It must be the sort of 
book you like. You open it and read the table of contents. A deep peace 
fills your soul. Here is this delicious book and the whole day, both yours. 
You lean back to think of books by these men and by others that you 
already know and love. Memory brings you one beautiful picture after 
another. 

VIII, b. Memory of Passages 
Thirty-two passengers were injured, none of them seriously, by the 
derailment of the Chattanooga and Washington Limited train on the 
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Fig. 2 
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Southern Railway, thirty miles south of Charlottesville, and just north 
of Ryan's Siding, Virginia, early to-day. A broken rail was the cause of 
the accident. 

The entire train composed of a baggage car, day coach and three sleep- 
ers, left the track, the sleepers being almost destroyed by fire. A special 
train was quickly made up and proceeded to this city with all the passen- 
gers of the Limited. The wreck blocked the track for several hours, all 
trains meanwhile being detained. 

VIII, c. Memory of Passages 
Langford of the Three Bars, as the title suggests, is a story of the West 
depicting cowboy life. The scenes are in South Dakota of the time of the 
"rustlers," who cared for neither the interference of man nor law. The 
action turns round the Three Bars Ranch, which is run by Paul Langford, 
" a man — a godlike type with his sunny hair and his great strength," whose 
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object it is to do away with the cattle thieves headed by Jesse Black. He 
is aided by Gorden, the county attorney, and Jim Munson, a real cowboy. 

VIII, d. Memory of Passages 
One morning a couple of Springs ago, if any of your readers had chanced 
this way, they might have seen me coming from the vineyard with two 
bluebirds, one in each hand. The birds were well and vigorous and en- 
tirely unharmed. If questioned I might have explained that I went down 
into the vineyard and picked the birds up off the ground, where they had 
the full possession of their wings, and that there are times when it is not 
difficult for me to do such things. These birds were of the species known 
as the Least-flycatcher, or Chebeck Bird. 

Test IX. "Drawing Lengths." 

I. To the right of i, draw in succession three lines each equal in length 
to I. You may add to, or take away from, the line you have drawn, as 
much as necessary in order to get it the required length. 

Similarly with lines 2 and 3. (Lines I, 2 and 3 were 100, 75 and 50 
millimeters long respectively.) 

Test X. "Estimating Lengths." 

Which line is the longer? (Two lines drawn end to end horizontally 
on a long piece of white card-board were shown.) In case of the first 
8 lines thus shown on the card-board, one line was ro8 mm. in length, 
and the other 100 mm. In case of the second set of 8 pairs of lines, one 
line was 106 mm. and the other 100. In the third set, one was 104 and the 
other 100 ; and in the fourth set one was 102 and the other 100 mm. The 
whole test was then repeated. 

Test XL "Adding." 

Add the ten sums in XI, a, as quickly and accurately as you can, writing 
down the results as you get them. 

Similarly with XI, b. 

XIa. Addition 

17 26 27 72 23 

42 51 24 13 47 

38 47 83 39 86 

91 82 19 81 64 

54 63 45 26 36 



17 


42 


38 


91 


36 


26 


51 


47 


82 


26 


27 


24 


83 


19 


45 


72 


14 


39 


62 


63 


23 


47 


86 


54 


54 
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Xlb. Addition 






41 


53 * 


67 


78 


86 


52 


67 


86 


37 


32 


86 


34 


23 


96 


44 


23 


78 


45 


72 


36 


35 


19 


67 


23 


68 


45 


52 


19 


45 


23 


13 


86 


78 


67 


92 


68 


23 


67 


78 


36 


77 


35 


23 


37 


68 


86 


67 


86 


96 


39 



Test XII. "Hard Opposites." 

Write as quickly as you can beside each word in the column a word 
that means the exact opposite of it. Do the best you can with each word 
rather than leave the space blank. 



XII, b 
serious 
grand 
clumsy 
to win 
to respect 
frequently 
to lack 
apart 
stormy 
motion 
forcible 
to float 
straight 
to hold 
after 
unless 
rough 
to bless 
to take 
exciting 



Hard 
XII, a 
vertical 
ignorant 
rude 
simple 
deceitful 
stingy 
permanent 
over 

to degrade 
weary 
to spend 
to reveal 
genuine 
level 
broken 
wild 
part 
past 
permit 
precise 



Opposites 

XII, d 
succeed 
strict 
tardy 
sleepy 
suspicious 
rigid 
suave 
sinful 

conservative 
refined 
pride 

despondent 
imaginary 
beautiful 
injurious 
diligent 
sell 
sure 
active 
venturesome 



XII, c 
tender 
animated 
proficient 
impoverish 
cruel 
generous 
haughty 
silly 

insignificant 
disastrous 
miser 
result 
hindrance 
strength 
innocent 
busy 

remember 
increase 
preserve 
belief 



Test XIII. "Completing Words," or " Ba-test." 

As quickly as possible, add any letter or letters to each syllable in the 
list, so as to make it a complete word. 



Appendix' ug 

Completing Words 



a. 




b. 




c. 




d. 


ba 


be 


bi 


bo 


ab 


ea 


ic 


ao 


ca 


ce 


ci 


CO 


ac 


eb 


id 


ob 


da 


de 


di 


do 


ad 


ec 


ig 


oc 


fa 


fe 


fi 


fo 


af 


ed 


il 


od 


ga 


ge 


gi 


go 


ag 


ef 


im 


of 


ha 


he 


hi 


ho 


al 


ei 


in 


ol 


ja 


je 


ji 


jo 


am 


el 


ir 


om 


la 


ke 


ki 


lo 


an 


em 


is 


on 


ma 


le 


li 


mo 


ap 


en 


it 


op 


na 


me 


mi 


no 


ar 


ep 


iv 


or 


pa 


ne 


ni 


po 


as 


eq 


um 


OS 


ra 


pe 


pi 


ro 


at 


er 


un 


ot 


sa 


re 


ri 


so 


au 


es 


up 


ou 


ta 


se 


si 


to 


va 


ev 


ur 


ov 


va 


te 


ti 


vo 


aw 


ex. 


us 


ow 



Test XIV. "Ebhinghaus Mutilated Text." 

(The subject was first shown what was to be done on a sample sheet 
similar to the ones given below.) 

Fill in each blank with the word that will make the best sense. Do the 
work as well and as quickly as you can. Put only one word in each blank 
space. 

Test XIV, a. Ebhinghaus Mutilated Text. 

Park Hill on the Hudson offers you a solution of the home. problem 
to-day. No home seeker or investor can afford to ignore its claims. 
Escape the wear and tear of the city's noise and rush in this open air para- 
dise, just at the city's edge, in all respects an ideal home location for your- 
self and family. 

There are cottages containing every improvement waiting for you to 
step in and make yourself comfortable. It not only commands the most 
beautiful view around New York but is protected for all time against 
intrusion. Choice lots now selling on very easy terms. 

Test XIV, b. Ebbinghaus Mutilated Text. 

We believe we can prove to you that this investment is so secure and 
the dividends so sure, that it justifies you in withdrawing money from 
the Savings Banks, where it is earning 3>4% and putting it in our busi- 
ness where it will earn y%. We are a New England enterprise, managed 
hy New England men, and we have behind us a record of fourteen years 
of unbroken success. Whether you have much or little you cannot afford 
,to let slip this opportunity of doubling the income from your savings. 
Prompt action in this matter will repay you well. 

Test XIV, c. Ebbinghaus Mutilated Text. 

On the contrary, it didn't cost me a dollar. In fact, though at times 
I have found myself possessed of considerable sums of ready money, I 
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have never been a man of property in the strict sense of the word. I 
abandoned my profession, the law, as I did not find its practice so lucra- 
tive as I had hoped. For some years thereafter I travelled largely on the 
Mississippi River. It was the decline in steamboating and the adoption of 
less leisurely methods of travel that cut into my income and forced me 
to come North and engage in trade. 

Test XIV, d. Ebbinghaus Mutilated Text. 

The occult in everyday affairs is the theme of this new book by Robert 
Chalmers. Opening one of the thrilling stories of ■which the volume is 
composed is the tale of some awful mysterious happening, some super- 
natural event beyond the power of material reasoning of mortal man to 
explain, which comes into the life of some ordinary, everyday man. The 
opening chapter tells of a dinner given to a man deeply versed in occult- 
ism by his American friends. To these he giv^s many hints and sugges- 
tions of momentous things which he can plainly see waiting for them in 
the future. 

Test XIV, e. Ebbinghaus Mutilated Text. 

I asked the slovenly, but cheerful female who answered the bell for 
the landlady; wondering the while what I should say when I was asked 
for references. The merriment had not been called forth by anything 
amusing in my appearance, as my vanity had feared, but by a story which 
a man sitting at the head of the table was just finishing. The only vacant 
chair in the room was beside him, and, rather awkwardly, for I felt that 
they were taking my measure, I made my way toward it. As I sat down 
he greeted me with a polite bow. 

Test XIV, f. Ebbinghaus Mutilated Text. 

If we are perfectly well, thoroughly sound, we need' not be depressed. 
The perfectly healthy animal has no worries. The remedy has already 
been indicated. Regretfully it is so simple that very few people take the 
trouble to apply it. When it is clearly and widely recognized that worry 
is stupid, that its cure is simple where there is no organic trouble, worry 
will cease. Worry is simply a form of what for the sake of a nice large 
word, is called " neurasthenia," nerve-depletion. 

Given plenty of recreation, plenty of fresh air, and the normal man will 
not worry. 

Test XIV, g. Ebbinghaus Mutilated Text. 

We confess to something of sympathy with the correspondent who 
hinted yesterday that when children are run over and killed by automo- 
biles, the fault is not always that of the automobilist, but sometimes rests 
in some measure on those who do not teach their children to avoid un- 
necessary danger. It is a plain fact, of course, that public highways are 
for the use of the whole population, and that the automobilist is under 
every obligation to keep the limitations of his rights and privileges in 
mind as he goes along, but the road is his as well as other people's. 
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Test XIV, h. Ebbinghaus Mutilated Text. 

A law in defence of property rights in the broadest sense if observed 
would almost abolish international conflicts. Gentlemen do not fight with 
fists in money differences nor do they refer them to courts of honor. Civil 
courts are for that purpose and are as useful for nations as for men. 
The sanction of international law must be merely moral, for a long time 
at least. But in order that there should be any moral sanction there must 
be a moral code. The principles of such a code are deducible from 
treaties to which nations have set their hands and seals. 
Test XV. "Absurdities." 

As quickly as possible mark each sentence that contains an absurdity 
or impossibility. For instance, if a sentence stated or implied that ordi- 
nary lead was floating on water, mark such a sentence as impossible or 
absurd. Do not mark the sentences that contain no absurdity or imposssi- 
bility. 

Test XV, a. Absurdities. 

1. Though armed only with his little dagger, he brought down his 
assailant with a single shot. 

2. Silently the young dude hurried on, in spite of the darkness, and 
went splash into a puddle on the roadside. 

3. Having reached the goal I looked back and saw my opponents still 
running in the distance. 

4. While walking backwards he struck his forehead against a wall, and 
was knocked insensible. 

5. Offended by his obstinate silence, she refused to listen to him further. 

6. With his sword he pierced his adversary who fell dead. 

7. The one-armed cripple was attacked by a dog, which seized his 
wrist, but he pushed it off with the other hand. 

8. While forcing my way through the crowd, I came suddenly upon an 
old friend. 

Test XV, b. Absurdities. 

1. The dogs pursued the stag through flower gardens in full bloom. 

2. The storm which began yesterday morning, has continued without 
intermission for three days. 

3. That day we came in sight of several icebergs that had been entirely 
melted by the warmth of the Gulf Stream. 

4. While sharpening his three-bladed knife, my cousin cut his middle 
finger. 

5. My friend pointed out the North Star clearly visible on our right as 
we walked briskly eastward in the moonlight. 

6. The red haired girl, standing in the corner, is taller than any of her 
brothers. 

7. The two towns were separated only by a narrow stream, which was 
frozen over all winter. 

8. Fearing that he might waken her patient by his impudent talk, the 
nurse gave the detested dummy what he wished. 
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Test XV, c. Absurdities. 

1. After dressing herself carefully and elaborately, she descended to 
the breakfast-room, only to find it deserted. 

2. Preferring a tarnished reputation to the probability of becoming a 
corpse for the rest of his life, the young soldier took to flight. 

3. Upon very careful testing it has been found that a pint of cream 
weighs slightly more than a pint of milk. 

4. The old soldier energetically shouldered his crutch like a gun, as 
he talked by the fireside. 

5. In the busiest sections of New York City, cheap houses, like rare 
jewels, are scarce and expensive. 

6. In the ruins of an ancient Roman city there has recently been dis- 
covered a small skull believed to have been that of Pontius Pilate when 
he was about ten years old. 

7. We serve hot and cold lunches on five minutes' notice, to first and 
second class passengers. 

8. In my excitement I caught a glimpse of the sharp features of my 
enemy, who had just passed around the corner. 

Test XV, d. Absurdities. 

1. Our horse grew so tired that finally we were compelled to walk up 
all the hills. 

2. The hands of the clock were set back, so that the meeting might 
surely close before sunset. 

3. In some states there is a law forbidding a man to marry his widow's 
sister. 

4. Don't go to unreliable real estate offices to be swindled, come in here. 

5. Owing to the lack of ready money, the shrewd financier was unable 
to take advantage of the rare bargains then offered. 

6. Travellers who cannot read should be directed to sources of reliable 
information by signs printed in conspicuous places along the roads of 
travel. 

7. With wrapt attention, though the audience was immense, the orator 
listened to the crowd addressing him. 

8. Our office boy has been coming early of late, for he was often be- 
hind before, because his watch was slow. 



